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© Single domain ligands, receptors comprising said ligands, methods for their production, and use of 
said ligands and receptors. 
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© The present invention relates to single domain 
ligands derived from' molecules in the im- 
munoglobulin (Ig) superfamily. receptors comprising 
at least one such ligand. methods for cloning, am- 
plifying and expressing DNA sequences encoding 
such ligands, preferably using the polymerase chain 
reaction, methods for the use of said DNA se- 
quences in the production of Ig-type molecules and 

J said ligands or receptors, and the use of sard ligands. 

"or receptors in therapy, diagnosis or catalysis. 
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Single Domain Ligands, Receptors comprising said ligands, Methods for their Production, and Use of 

said Ligands and Receptors 



The present invention relates to single domain 
ligands derived from molecules in the im- 
munoglobulin (lg) superfamily. receptors compris- 
ing at least one such ligand, methods for cloning, 
amplifying and expressing DMA sequences encod- 
ing such ligands, methods for the use of said DNA 
sequences in the production of lg-type molecules 
and said ligands or receptors, and the use of said 
ligands or receptors in therapy, diagnosis or cataly- 
sis. 

A list of references is appended to the end of 
the description. The documents listed therein are 
referred to in the description by number, which is 
given in square brackets []. 

The lg supertamily includes not only the Igs 
themselves but also such molecules as receptors 
on lymphoid cells such as T lymphocytes. Im- 
munoglobulins comprise at least one heavy and 
one light chain covalently bonded together. Each 
chain is divided into £ number of domains. At the N 
terminal end of each chain is a variable domain. 
The variable domains on the heavy and light chains 
fit together' to form a binding site designed to 
receive a particular target molecule. In the case of 
Igs. the target molecules are antigens. T-cell recep- 
tors have two chains of equal size, the o and $ 
chains, each consisting of two domains. At the N- 
terminal end of each chain is a variable domain 
and the variable domains -on the o and 0 chains 
are believed to fit together to lorm a binding site 
for target molecules, in this case peptides pre- 
sented by a histocompatibility antigen. The variable 
domains are so called because their amino acid 
sequences vary particularly from one molecule to 
another. This variation in sequence enables the 
molecules to recognise an extremely wide variety 
of target molecules. 

Much research has been carried out on lg 
molecules to determine how the variable domains 
are produced. It has been shown that each variable 
domain comprises a number of areas of relatively 
conserved sequence and three areas of hyper- 
variable sequence. The three hypervariable areas 
are generally known as complementarity determin- 
ing regions (CDRs). 

Crystallographic studies have shown that in 
each variable domain of an lg molecule the CDRs 
are supported on framework areas formed by the 
areas of conserves sequences. The three CDRs 
are brought together by the framework areas and. 
together with the CDRs on the other chain, form a 
pocket in which the target molecule is received. 

Since the adven; of recombinant DNA technol- 
mv thorp h?.^ been much interest in the use of 



' such technology to clone and express lg molecules 
and derivatives thereof. This interest is reflected in 
the numbers of patent applications and other pub- 
lications on the subject. 

5 The earliest work on the cloning and expres- 

sion of full Igs in the patent literature is EP-A-0 120 
694 (Boss). The Boss application also relates to the 
cloning and expression of chimeric antibodies. 
Chimeric antibodies are lg-type molecules in which 

to the variable domains from one lg are fused to 
constant domains from another lg. Usually, the 
variable domains are derived from an lg from one 
species (often a mouse lg) and the constant do- 
mains are derived from an lg from a different 

is species (often a human lg). 

A later European patent application. EP-A-0 
125 023 (Genentech). relates to much the same 
subject as the Boss application, but also relates to 
the production by recombinant DNA technology of 

20 other variations of lg-type molecules. 

EP-A-0 194 276 (Neuberger) discloses not only 
chimeric antibodies of the type disclosed in the 
Boss - application but also chimeric antibodies in 
which some or all of the constant domains have 

25 been replaced by non-lg derived protein se? 
quences. For instance, the heavy chain CH2 and 
CH3 domains may be replaced by protein se- 
quences derived from an enzyme or a protein 
toxin. 

30 EP-A-0 239 400 (Winter) discloses a different 

approach to the production of !g molecules. In this 
approach, only the CDRs from a first type of lg are 
grafted onto a second type of lg in place of its 
normal CDRs. The lg molecule thus produced is 

35 predominantly of the second type, since the CDRs 
form a relatively small part of the whole lg. How* 
ever, since the CDRs are the parts which define 
the specificity of the lg. the lg molecule thus pro- 
duced has its specificity derived from the first lg. 

40 Hereinafter, chimeric antibodies. CDR-grafted 

Igs. the altered antibodies described by Genen- 
tech. and fragments, of such Igs such as F(ab )z 
and Fv fragments are referred to herein as modi- 
fied antibodies. 

45 One of the main reasons for all the activity in 
the lg field using recombinant DNA technology is 
the desire to use Igs in therapy. U is well known 
that, using the hybridoma technique developed by 
Kohler and Milstein. it is possible to produce mon- 

so oclonal antibodies (MAbs) of almost any specificity. 
Thus. MAbs directed against cancer antigens have 
been produced. It is envisaged that these MAbs 
could be covalently attached or fused to toxins to 
orovide "maaic bullets" for use in cancer therapy. 
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MAbs directed agaw-i-fcrrrLal tissue of ceU .surface . 
antigens have also been produced. Labels can be 
attached to these so that they can be used for in 
vivo imaging. 

The major obstacle to the use of such MAbs in 
therapy or in vivo diagnosis is that the vast major- 
ity oi MAbs which are produced are of rodent, in 
particular mouse, origin. It is very difficult to pro- 
duce human MAbs. Since most MAbs are derived 
from non-human species, they are antigenic in hu- 
mans. Thus, administration of these MAbs to hu- 
mans generally results in an anti-lg response being 
mounted by the human. Such a response can 
interiere with therapy or diagnosis, for instance by 
destroying or clearing the antibody quickly, or can 
cause allergic reactions or immune complex hyper- 
sensitivity which has adverse effects on the patient. 

The production of modified Igs has been pro- 
posed to ensure that the Ig administered to a 
patient is as "human" as possible, but still retains 
the appropriate specificity. It is therefore expected 
that modified Igs will be as effective as the MAb 
from which the specificity is derived but at the 
same time not very antigenic. Thus, it should be 
possible to use the modified Ig a reasonable num- 
ber of times in a treatment or diagnosis regime. 

At the level of the gene, it is known that heavy 
chain variable domains are encoded by a 
"rearranged" gene which is built from three gene 
segments: an prearranged" VH gene (encoding 
the N-terminal three framework regions, first two 
complete CDRs and the first pan of the third CDR). 
a diversity (DH)-segment (DH) (encoding the cen- 
tral portion of the third CDR) and a joining segment 
(JH) (encoding the last part of the third CDR and 
the fourth framework region). In the maturation of 
B-cells. the genes rearrange so that each unrear- 
ranged VH gene is linked to one DH gene and one 
JH gene. The rearranged gene corresponds to VH- 
DH-JH. This rearranged gene is linked to a gene 
which encodes the constant portion of the Ig chain. 

For light chains, the situation is similar, except 
that for light chains there is no diversity region. 
Thus light chain variable domains are encoded by 
an Prearranged" VL gene and a JL gene. There 
are two types of light chains, kappa {*) or lambda 
(X). which are built respectively from unrearranged 
V* genes and J* segments, and from unrearranged 
VX genes and JX segments. 

Previous work has shown that it is necessary to 
have two variable domains in association together 
for efficient binding. For example, the associated 
heavy and light chain variable domains were shown 
to contain the antigen binding site [1]. This as- 
sumption is borne out by X*ray crystallographic 
studies of crystallised antibody/antigen complexes 
(2-6] which show that both the heavy and light 
chains of the antibody's variable domains contact 



antigen. The expectation that association of 
heavy and lien; chain variable domains is neces- 
sary for efficient antigen binding underlies work to 
co-secrete these comair.s from bacteria [ij. end to 
s link the domains together by a short section of 
polypeptide as in the single chcin antibodies [8. 9]. 

Binding of isolated heavy and light chains had 
also been detected. However the evidence sug- 
gested strongly that this was a property of heavy or 
io light chain d;mers. Early work, mainly with poly- 
clonal antibodies, in which antibody heavy and light 
chains had been separated under denaturing con- 
ditions [10] suggested that isolated antibody heavy 
chains could cind to protein antigens (11) cr hapten 
15 112). The binding of protein antigen was not 
characterised, but the hapten-binding affinity cf the 
heavy chain fragments was reduced by two orders- 
of magnitude [12] and the number of hapten mol- 
ecules binding were variously estimated as 0.14 or 
20 0.37 [13] or 0.26 [14] per isolated heavy chain. 
Furthermore binding of haptens was shown to be a 
property of clmeric heavy or dimeric light chains 
[14]. Indeed light chain dimers have been crystai- 
lised. lt has teen shown that in light chain dimers 
25 the two chains form a cavity which is able to bind 
to a single moiecule of hapten [15). 

This confirms the assumption that, in order to 
obtain efficient. binding, i; is necessary to have a 
dimer. and preferably a heavy chain/light chain 
30 dimer. containing the respective variable domains. 
This assumption also underlies the teaching of the 
patent references cited above, wherein the inten- 
tion is always to produce dimeric. and preferably 
heavy/light chain dimeric. molecules. 
35 It has now been discovered, contrary to ex- 

pectations, that isolated Ig heavy chain variable 
domains can bind to antigen in a 1:1 ratio and with 
binding constants of equivalent magnitude to those 
of complete antibody molecules. In view of what 
40 was known up until now and in view of the assump- 
tions made by those skilled in the art, this is h.ghly 
surprising. ■ 

Therefore, according to a first aspect of the 
present invention, there is provided a single do- 
« main ligand consisting at least part of the variable 
domain of one chain of a molecule from the Ig 
superfamily. 

Preferably, the ligand consists of the variable 
domain of an Ig light, or. most preferably, heavy 
so chain. 

The ligand may be produced by any known 
technique, for instance by controlled cleavage of Ig 
superfamily molecules or by peptide synthesis. 
However, preferably the ligand is produced by re- 
55 combinant DNA technology. For instance, the gene 
encoding the rearranged gene for a heavy cha.n 
variable domain may be produced, for instance. by 
cloning or gene synthesis, and placed into a suit- 
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able expression vector. The expression vector .s 
then used to transform a compatible host cell 
which is then cultured tc allow the ligand to be 
expressed and. preferably, secreted. 

If desired, the gene for the ligand can be 
mutated to improve the properties of the expressed 
domain, for example to increase the yields of ex- 
pression or the solubility of the ligand. to enable 
the ligand to bind better, or to introduce a second 
site for covalent attachment (by introducing chemi- 
cally reactive residues such as cysteine and his- 
tidine} or non-covalent binding of other molecules. 
In particular it would be desirable to introduce a 
second site for binding to serum components, to 
prolong the residence time of the domains in the 
serum; or for binding to molecules with effector 
functions, such as components of complement, or 
receptors on the surfaces of cells. 

Thus, hydrophobic residues which would nor- 
mally be at the interface of the heavy chain vari- 
able domain with the light chain variable domain 
cculd be mutated to more hydrophilic residues to 
improve solubility; residues in the CDR loops could 
be mutated to improve antigen binding: residues on 
the other loops or parts of the S-sheet could be 
mutated to introduce new binding activities. Muta- 
tions could include single point mutations, multiple 
point mutations or more extensive changes and 
could be introduced by any of a variety of recom- 
binant DNA methods, for example gene synthesis, 
site directed mutagenesis or the polymerase chain 
reaction. 

Since the iigancs of the present invention have 
equivalent binding affinity to that of complete )g 
molecules, the ligands can be used in many of the 
ways as are Ig molecules or fragments. For exam- 
ple. !g molecules have been used in therapy (such 
as in treating cancer, bacterial and viral diseases), 
in diagnosis (such as pregnancy testing), in vac- 
cination (such as in producing anti-idiotypic anti- 
bodies which mimic antigens), in modulation of 
activities of hormones or growth factors, in detec- 
tion, in biosensors and in catalysis. 

tt is envisaged that the small size of the ligands 
of the present invention may confer some advan- 
tages over complete antibodies, for example, in 
neutralising the activity of low molecule/ weight 
drugs (such as digoxin) and allowing their filtration 
from the kidneys with drug attached; in penetrating 
tissues and tumours; in neutralising viruses by 
binding to smalt conserved regions on the surfaces 
of viruses such as the "canyon" sites of viruses 
(16]; in high resolution epitope mapping of proteins; 
end in vaccination by ligands which mimic anti- 
gens. 

The present invention also provides receptors 
comprising a ligand according to the first aspect of 
the invention linked to one or more of an effector 



molecule, a label, e surface, or one or more other 
ligands having the same or different specificity. 

A receptor comprising a licand linked to an 
effector molecule may be of use in therapy. The 

5 effector molecuie m2y be a toxin, such as ricin or 
pseudomonas exotoxin, an enzyme which is able to 
activate a prodrug, a binding partner or a radio- 
isotope. The radio-isotope may be directly linked to 
the ligand or may be attached tnereto by a cheiat- . 

io ing structure which is directly linked to the ligand. 
Such ligands with attached isotopes are much 
smaller than those based on Fv fragments, and 
could penetrate tissues and access tumours more 
readily. 

is A receptor comprising a ligand linked to a label 

may be of use in diagnosis. The label may be a 
heavy metal atom or a radio-isotope, in which case 
the receptor can be used for in vivo imaging using 
X-ray or other scanning apparatus. The metal atom 
20 or radio-isotope may be attached to the ligand 
either directly or via a chelating structure directly 
linked to the ligand. For in vitro diagnostic testing, 
the label may be a heavy metal atom, a radio- 
isotope, an enzyme, a fluorescent or coloured mot- 
25 ecule or a protein or peptide tag which can be 
detected by an antibody, an antibody fragment or 
another protein. Such receptors would be used in 
any of the known diagnostic tests, such as ELISA 
or fluorescence-linked assays. 
jo A receptor comprising a ligand linked to a 

surface, such as a chromatography medium, could 
be used for purification of other molecules by affin- 
ity chromatography. Linking of ligands to cells, for 
example to the outer membrane proteins of E. colt • 
35 or to hydrophobic tails which localise the ligands in 
the cell membranes, could allow a simple diagnos- 
tic test in which the bacteria or cells would agglu- 
tinate in the presence of molecules bearing mul- 
tiple sites for binding the ligand(s). 
<o Receptors comprising at least two ligands can 
be used, for instance, in diagnostic tests. The first 
ligand will bind to a test antigen and the second 
ligand will bind to a reporter molecule, such as an 
enzyme, a fluorescent dye. a coloured dye. a 
45 radio-isotope or a coloured-, fluorescentty- or 
radio-labetled protein. 

Alternatively, such receptors may be useful m 
increasing the binding to an antigen. The first 
ligand will bind to a first epitope of the antigen and 
so the second ligand will bind to a second ep.tope. 
Such receptors may also be used for increas.ng 
the affinity and specificity of binding to different 
antigens in close proximity on the surface of cells. 
• The first ligand will bind to the first antigen and the 
55 second epitope to the second antigen: strong bind- 
ing will depend on the co-expression of the epi- 
topes on the surface of the cell. This may be useful 
in therapy of tumours, which can have elevated 
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expression, of sers^L. surface .markers,, Jurfrw . 
ligands could be adaed to further improve" binding 
or specificity. Moreover, the use of strings of 
ligands. with the seme or multiple specificities, 
creates a larger molecule which is less readily 
filtered from the circulation by the kidney. 

For vaccination with ligands which mimic anti- 
gens, the use of strings cf ligands may prove more 
effective than single ligands. due to repetition cf 
the immunising epitopes. 

If desired, such receptors with multiple ligands 
could include effector molecules or labels so that 
they can be used in therapy or diagnosis as de- 
scribed above. 

The ligand may be linked to the other part of 
the receptor by any suitable means, for instance by 
covalent or non-covalent chemical linkages. How- 
ever, where the receptor comprises a ligand and 
another protein molecule, it is preferred that they 
are produced by recombinant DNA technology as a 
fusion product. If necessary, a linker peptide se- 
quence can be placed between the ligand and the 
other protein molecule to provide flexibility. 

The basic techniques for manipulating Ig mol- 
ecules by recombinant DNA technology are de- 
scribed in the patent references cited above. These 
may be adapted in order to allow for the production 
of ligands and receptors according to the invention 
' by means of recombinant DNA technology. 

Preferably, where the ligand is to be used for 
in vivo diagnosis or therapy in humans, it is 
humanised, for instance by CDR replacement as 
described in EP-A-0 239 400. 

In order to obtain a DNA sequence encoding a 
ligand. it is generally necessary, firstly to produce a 
hybridoma which secretes an appropriate MAb. 
This can be 2 very time consuming method. Once 
an immunised animal has been produced, it is 
necessary to fuse separated spleen cells with a 
suitable myeloma cell line, grow up the cell lines 
thus produced, select appropriate lines, redone the 
selected lines and reseleci. This can take some 
long time. This problem also applies to the produc-. 
tion of modified Igs. 

A further problem with the production of 
ligands. and also receptors according to the inven- 
tion and modified Igs. by recombinant DNA tech- 
nology is the cloning of the variable domain encod- 
ing sequences Irom the hybridoma which produces 
the MAb from which the specificity is to be derived. 
This can be a relatively long method involving the 
production of a suitable probe, construction of a 
clone library from cDNA or genomic DNA. exten- 
sive probing of the clone library, and manipulation 
of any isolated clones to enable the cloning into a 
suitable expression vector. Due to the inherent 
variability of the DNA sequences encoding Ig vari- 
able domains, it has not previously been possible 



.-.12. avoid sucn time consuming work. It is therefore 
* "a further aim of the present invention to provide a 
method which enables substantially any sequence 
encoding an Ig superfamily molecule va-iable do- 
5 main (ligand) to be cloned in a reasonable period 
of time. 

According to another aspect of the present 
invention therefore, there is provided s method of 
cloning a sequence (the target sequence) which 
ro encodes at least part of the variable domain of an 
Ig superfamily molecule, which method comprises: 
(a) providing a sample of double stranded 
(ds) nucleic acid which contains the target se- 
quence; 

i5 (b) denaturing the sample so as to separate 

the two strands; 

(c) annealing to the sample a forward and a 
back oligonucleotide primer! the forward primer be; 
ing specific for a sequence at or adjacent the 3 

20 end of the sense strand of the target sequence, the 
back primer being specific for a sequence at or 
adjacent the 3* end of the antisense strand of the 
target sequence, under conditions which allow the 
primers to hybridise to the nucleic acid at or adja- 

rs cent the target sequence; 

(d) treating the annealed sample with a DNA 
polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which 
cause primer extension to take place; and 

30 (e) denaturing the sample under conditions 

such that the extended primers become separated 
from the target sequence. 

Preferably, the method of the present invention - 
further includes the step (f) of repeating steps <c) to 

as (e) on the denatured mixture a plurality of times. 

Preferably, the method of the present invention 
is used to done complete variable domains from Ig 
molecules, most preferably from tg heavy chains. 
In the most preferred instance, the method will 

-do produce a DNA sequence encoding a ligand ac- 
cording to the present invention. 

In step (c) recited above, the forward primer 
becomes annealed to the sense strand of the target 
sequence at or adjacent the 3 end of the strand. In 

<s a similar manner, the back primer becomes an- 
nealed to the antisense strand of the target se- 
quence at or adjacent the 3' end of the strand. 
Thus, the forward primer anneals at or adjacent the 
region of the ds nucleic acid which encodes the C 

so terminal end of the variable region or domain. 
Similarly, the back primer anneals at or adjacent 
the region of the ds nucleic acid which encodes the 
N-terminal end of the variable domain. 

In step <d). nucleotides are added onto the 3 
55 end of the forward and back primers in accordance 
with the sequence of the strand to which they are 
annealed. Primer extension will continue in this 
manner until stopped by the beginning of the de- 
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naturinc step (e). It must therefore be ensured that 
step (d) is carried out for a long enough time to 
ensure that the primers are extended so that the 
extended strands totally overlap one another. 

In step (e). the extended primers are separated 
from the ds nucleic acid. The ds nucleic acid can 
then serve again as a substrate to which further 
primers can anneal. Moreover, the extended prim- 
ers themselves have the necessary complementary 
sequences to enable the primers to anneal thereto. 

During further cycles, if step (f) is used, the 
amount of extended primers will increase exponen- 
tially so that at the end of the cycles there will be a 
large quantity of cDNA having sequences com- 
plementary to the sense and antisense strands of 
the target sequence. Thus, the method of the 
present invention will result in the accumulation of 
a large quantity of cDNA which can form ds cDNA 
) encoding at least part of the variable domain. 

As will be apparent to the skilled person, some 
of the steps in the method may be carried out 
simultaneously or sequentially as desired. 

The forward and back primers may be pro- 
vided as isolated oligonucleotides, in which case 
only two oligonucleotides will be used. However, 
alternatively the forward and back primers may 
each be supplied as a mixture of closely related 
oligonucleotides. For instance, it may be found that 
at a particular point in the sequence to which the. 
primer is to anneal, there is the possibility of 
nucleotide variation. In this case a primer may be 
used for each possible nucleotide variation. Fur- 
thermore it may be possible to use two or more 
sets of "nested" primers in the method to enhance 
the specific cloning of variable region genes. 

The method described above is similar to the 
method described by Saiki et a!.' [17]. A similar 
) method is also used in the methods described in 
EP-A-0 200 362. In both cases the method de- 
scribed is carried out using primers which are 
known to anneal efficiently to the specified 
nucleotide sequence. In neither of these disclo- 
sures was it suggested that the method could be 
used to clone tg parts of variable domain encoding 
sequences, where the target sequence contains 
inherently highly variable areas. 

The ds nucleic acid sequence used in the 
method of the present invention may be derived 
from mRNA. For instance. RNA may be isolated in 
known manner from a cell or cell line which is 
known to produce Igs. mRNA may be separated 
from other RNA by oligo-dT chromatography. A 
complementary strand of cDNA may then be syn- 
thesised on the mRNA template, using reverse 
transcriptase and a suitable primer, to yield an 
RNA'DNA heteroduplex. A second strand of DNA 
can be made in one of several ways, for example, 
by priming with RNA fragments of the mRNA 



strand (made by incubating RNA'DNA heteroduplex 
with RNase H} and using DNA polymerase, or by 
priming with a synthetic olipdeoxynucleotide prim- 
er which anneals to the 3 end of the first strand 
5 and using DNA polymerase. It has been found that 
the method of the present invention can be carried 
out using os cDNA prepared in this way. 

When making such ds cDNA. it is possible to 
use a forward primer which anneals to a sequence 
to in the CM domain (for a heavy chain variable 
domain) or the Cx or C* domain (for a light chain 
variable domain). These will be located in close 
enough proximity to the target sequence to allow 
the sequence to be clcned. 
15 The back primer may be one which anneals to 

a sequence at the N-terminal end of the VH1. V* or 
VX domain. The back primer may consist of a 
plurality of primers having a variety of sequences 
designed to be complementary to the various fam- 
20 ilies of VHLVx or VX sequences known. Alter- 
natively the back primer may be a single primer 
having a consensus sequence derived from all the 
families of variable region genes. 

Surprisingly, it has been found that the method 
25 of the present invention can be carried out using 
genomic DNA. If genomic DNA is used, there is a 
very large amount of DNA present, including actual 
coding "sequences. , introns and untranslated se- 
quences between genes. Thus, there is consider- 
30 able scope for non-specific annealing under the 
conditions used. However, it has surprisingly been 
found that there is very little non-specific annealing. 
It is therefore unexpected that it has proved possi- 
ble to clone the genes of Ig-variable domains from 
J5 genomic DNA. 

Under some circumstances the use of genomic 
DNA may prove advantageous compared with use 
of mRNA, as the mRNA is readily degraded, and 
especially difficult to prepare from clinical samples 
ao of human tissue. 

Thus, in accordance with an aspect 'Of the 
present invention, the ds nucleic acid used in step 
(a) is genomic DNA. 

When using genomic DNA as the ds nucleic 
45 acid source, it will not be possible to use as the 
forward primer an oligonucleotide having a se- 
quence complementary to a sequence in a con- 
stant domain. This is because, in genomic DNA, 
the constant domain genes are generally separated 
so from the variable domain genes by a considerable 
number of base pairs. Thus, the site of annealing 
would be too remote from the sequence to be 
cloned. 

It should be noted that the method of the 
55 present invention can be used to clone both rear- 
ranged and unrearranged variable domain se- 
quences from genomic DNA. It is known that in 
germ line oenomic DNA the three genes, encoding 
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the VH, DH and -^-respectively, are sepaj£i2d . 
from one another by considerable numbers 'ol base 
pairs. On maturation of the immune response, 
these genes are rearranged so that t. K .e VH. DH and 
JH genes are fused together to provide the gene 
encoding the whole variable domain (see Figure l). 
By using a forward primer specHic icr a sequence 
at or adjacent the 3 end of the sense strand of the 
genomic prearranged" VH gene, it is possible to 
clone the Prearranged" VH gene alone, without 
also cloning the DH and JH genes. This can be of 
use in that it will then be possible to fuse the VH 
gene onto pre-cloned or synthetic DH and DH 
genes. In this way. rearrangement of the variable 
domain genes can be carried out in vitro. 

The oligonucleotide primers used in step (c) 
m2y be specifically designed for use with a particu- 
lar target sequence. .In this case, it will be neces- 
sary to sequence at least the 5* and 3 ends of the 
target sequence so that the appropriate 
oligonucleotides can be synthesised. However, the 
present inventors have discovered that it is not 
necessary to use such specifically cesigned prim- 
ers. Instead, it is possible to use a species specific 
general primer or a mixture of such primers (or 
annealing to each end of the target sequence. This 
is not particularly surprising as regards the 3 end 
of the target sequence. It is known that this end of 
the variable domain encoding sequence leads into 
a segment encoding JH which is known to be 
relatively conserved. However, it was surprisingly 
discovered that, within a single species, the se- 
quence at the 5' end of the targe: sequence is 
sufficiently well conserved to enable a species spe- 
cific general prime; or a mixture thereof to be 
designed for the 5 end of the target sequence. 

Therefore according to a preferred aspect of 
the present invention, in step £0 the two primers 
which are used are species specific general prim- 
ers, whether used as single primers or as mixtures 
of primers. This greatly facilitates the cloning of 
any undetermined target sequence since it will 
avoid the need to carry out any sequencing on the 
target sequence in order to produce target 
sequence-specific primers. Thus the method of this 
aspect of the invention provides a general method 
for cloning variable region or domain encoding 
sequences of a particular species. 

Once the variable domain gene has been clon- 
ed using the method described above, it may be 
directly inserted into an expression vector, for in- 
stance using the PCR reaction to paste the gene 
into a vector. 

Advantageously, however, each primer in- 
cludes a sequence including a restriction enzyme 
recognition site. The sequence recognised by the 
restriction enzyme need not be in the part of the 
primer which anneals to tne ds nucleic acid, but 



JUiy be. provided as an extension which does not 
anneal. The use of primers with restriction sites has 
the advantage that the DNA can be cut with at least 
one restriction enzyme which leaves 3 or 5 over- 

5 hanging nucleotides. Such DNA is more readily 
cloned into the corresponding sites on the vectors 
than blunt end fragments taken directly from the 
method. The cs cDNA produced a: the end of the 
cycles will thus be readily insertable into a cloning 

jo vector by use of the appropriate restriction en- 
zymes. Preferably the choice of restriction sites is 
such that the ds cDNA is cloned directly into an 
expression vector, such that the ligand encoded by 
the gene is expressed. In this case the restriction 

15 site is preferably located in the sequence which is 
annealed to the ds nucleic acid. 

Since the primers may not have a sequence 
exactly complementary to the target sequence to 
which it is to be annealed, for instance because oi 

20 nucleotide variations or because of the introduction 
of a restriction enzyme recognition site, it may be 
necessary to adjust the conditions in the annealing 
mixture to enable the primers to anneal to the ds 
nucleic acid. This is well within the competence of 

25 the person skilled in the art and needs no further 
explanation. 

In step (d). any DNA polymerase may be used. 
Such polymerases are known in the art and are 
available commercially. The conditions to be used 
30 with each polymerase are well known and require 
no further explanation here. The polymerase reac- 
tion will need to be carried out in the presence of 
the four nucleoside triphosphates. These and the 
polymerase enzyme may already be present in the 
05 sample or may be provided afresh for each cycle. 

The denaturing step (e) may be carried out. for 
instance, by heating the sample, by use of 
chaotropic agents, such as urea or guanidine. or by 
the use of changes in ionic strength or pH. Prefer- 
40 ably, denaturing is carried out by heating since this 
is readily reversible. Where heating is used to 
carry out the denaturing, it will be. usual to use a 
thermostable DNA polymerase, such as Taq poly- 
merase, since this will not need replenishing at 
<s each cycle. 

tf heating is used to control the method, a 
suitable cycle of heating comprises denaturation at 
about 95* C for about 1 minute, annealing at from 
30' C to 65 *C for about 1 minute and primer 
50 extension at about 75 *C for about 2 minutes. To 
ensure that elongation and renaturation is com- 
plete, the mixture after the final cycle is preferably 
held at about 60* C for about 5 minutes. 

The product ds cDNA may be separated from 
, 5 the mixture for instance by gel electrophoresis us- 
ing agarose gels. However, if desired, the ds cDNA 
may be used in unpurified form and inserted di- 
rectly into a suitable cloning or expression vector 
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by conventional methods. This will be particularly 
easy to accomplish if the primers include restric- 
tion enzyme recognition sequences. 

The method of the present invention may be 
used to make variations in the sequences encoding 
the variable domains. For example this may be 
acheived by using a mixture of related 
oligonucleotide primers as at least one of the prim- 
ers. Preferably the primers are particularly variable 
in the middle of the primer and relatively con- 
served at the 5' and 3' ends. Preferably the ends 
of the primers are complementary to the framework 
regions of the variable domain, and the variable 
region in the middle of the primer covers all or pari 
of a CDR. Preferably a forward primer is used in 
the area which forms the third CDR. If the method 
is carried out using such a mixture of 
oligonucleotides, the product will be a mixture of 
variable domain encoding sequences. Moreover, 
variations in the sequence may be introduced by 
incorporating some mutagenic nucleotide 
triphosphates in step (d). such that point mutations 
are scattered throughout the target region. Alter- 
natively such point mutations are introduced by 
performing a large number of cycles of amplifica- 
tion, as errors due to the natural error rate of the 
DNA polymerase are amplified, particularly when 
using high concentrations of nucleoside 
triphosphates. 

The method of this aspect of the present inven- 
tion has the advantage that it greatly facilitates the 
cloning of variable domain encoding sequences 
directly from mRNA or genomic DNA. This in turn 
will facilitate the production of modified Ig-type 
molecules by any of the prior art methodes re- 
ferred to above. Further, target genes can be clon- 
ed from tissue samples containing antibody pro- 
ducing cells, and the genes can be sequenced. By 
doing this, it will be possible to look directly at the 
immune repertoire of a patient. This 
"fingerprinting*' of a patient's immune repertoire 
could be of use in diagnosis, for instance of auto- 
immune diseases. 

In the method for amplifying the amount of a 
gene encoding a variable domain, a single set of 
primers is used in several cycles of copying via the 
polymerase chain reaction. As a less preferred 
alternative, there is provided a second method 
which comprises steps (a) to (d) as above, which 
further includes the steps of: 

(g) treating the sample of ds cDNA with 
traces of DNAse in the presence of DNA poly- 
merase I to allow nick translation of the DNA; and 

(h) cloning the ds cDNA into a vector. 

if desired, the second method may further in- 
clude the steps of: 

(i) digesting the DNA of recombinant plas- 
mi'ds to reiease DNA fragments containing genes 



encoding variable domains: and 

(j) treating the fragments in 2 further set of 
steps (c) to (h). 

Preferably the fragments are separated from 
5 • the vector and from other fragments of the in- 
correct size by gel electrophoresis. 

The steps (a) to (d) then (g} to (h) can be 
followed once, but preferably the entire cycle (c) to 
(d) and (g) to (j) 'S repeated at least once. In this 
10 way a priming step . in which the genes are spe- 
cifically copied, is followed by a cloning step, in 
which the amount of genes is increased. 

In step (a) the ds cDNA is oerived from mRNA. 
For Ig derived variable domains, the mRNA is 
15 preferably be isolated from lymphocytes which 
have been stimulated to enhance production of 
mRNA. 

in each step (c) the set of primers are prefer- 
ably different from the previous step (c). so as to 
20 enhance the specificity of copying. Thus the sets 
of primers form a nested set. For example, for 
cloning of Ig heavy chain variable domains, the first 
set of primers may be located within the signal 
sequence and constant region, as described by 
25 Larrick et ai.. [18], and the second set of primers 
entirely within the variable region, as described by 
Orlandi et al.. [19]. Preferably the primers of step 
(c) include restriction sites to facilitate subsequent 
cloning. In the last cycle the set of primers used in 
30 step (c) should preferably include restriction sites 
for introduction into expression vectors. In step (g) 
possible mismatches between the primers and the 
template strands are corrected by "nick transla- 
tion". In step (h). the ds cDNA is preferably cleav- 
35 ed with restriction enzymes at sites introduced into 
the primers to facilitate the cloning. 

According to another aspect of the present 
invention the product ds cDNA is cloned directly 
into an expression vector. The host may be pro- 
40 karyotic or eukaryotic. but is preferably bacterial. 
Preferably the choice of restriction sites in the 
primers and in the vector, and other features of the 
vector will allow the expression of complete 
ligands. while preserving all those features of the 
45 amino acid sequence which are typical of the 
(methoded) ligands. For example, for expression of 
the rearranged variable genes, the primers would 
be chosen to allow the cloning of target sequences 
including at least all the three CDR sequences. The 
50 cloning vector would then encode a signal se- 
quence (for secretion of the ligand). and sequences 
encoding the N-terminal end of the first framework 
region, restriction sites for cicning and then the C- 
terminal end of the last (fourth) framework region. 
55 For expression of unrearranged VH genes as 

part of complete ligands. the primers would be 
chosen to allow the cloning of target sequences 
including at least the first two CDRs. The cloning 
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ve:to: coulci then 6=~de signal sequence^h-e-N-. - 
terminal enc of the first framework region, restric- 
tion sites for cloning and then the C-terminat end of 
the third framework region, the third COR anc 
fourth framework region. 

Primers and cloning vectors may likewise be 
devised for expression of single CDRs. particularly 
the third CDR. as parts of complete ligands. The 
advantage of cloning repertoires of single CDRs 
would permit the design of a "universal" set of 
framework regions, incorporating desirable prop- 
erties such as solubility. 

Single ligands could be expressed alone or in 
combination with a complementary variable do- 
main. For example, a heavy chain variable domain 
can be expressed either as an individual domain 
or. if it is expressed with a complementary light 
chain variable domain, as an antigen binding site. 
Preferably the two partners would be expressed in 
the same cell, or secreted from the seme cell, and 
the prctems allowed to associate non-covalently to 
form an Fv fragment. Thus the two genes encoding 
the complementary partners can be placed in tan- 
dem and expressed from a single vector, the vec- 
tor including two sets of restriction sites. 

Preferably the genes are introduced sequen- 
tially: for example the heavy chain variable domain 
can be cloned first and then the light chain variable 
domain. Alternatively the two genes are introduced 
into the vector in a single step, for example by 
using the polymerase chain reaction to paste to- 
gether each gene with any necessary intervening 
sequence, as essentially described by Yon and 
Fried [29]. The two partners could be also ex- 
pressed as a linked protein to produce a single 
chain Fv fragment, using similar vectors to those 
described above. As a further alternative the two 
genes may be placed in two different vectors, for 
example in which one vector is a phage vector and 
the other is a plasmid vector. 

Moreover, the cloned ds cDNA may be in- 
serted into an expression vector already containing 
sequences encoding one or more constant do- 
mains to allow the vector to express Ig-type chains. 
The expression of Fab fragments, for example, 
would have the advantage over Fv fragments that 
the heavy and light chains, would tend to associate 
through the constant domains in addition to the 
variable domains. The final expression product may 
be any of the modified Ig-type molecules referred 
to above. 

The cloned sequence may also be inserted 
into an expression vector so that it can be ex- 
pressed as a fusion protein. The variable domain 
encoding sequence may be linked directly or via a 
linker sequence to a DNA sequence encoding any 
protein effector molecule, such as a toxin, enzyme, 
label or another ligand. The variable domain se* 



. ..ounces may also be linked to proteins on the 
' outer side of bacteria or phage. Thus, the method 
of this aspect of the invention may be used to 
produce receptors according to the invention. 

s According to another aspect of the invention, 

the cloning of ds cDNA directly for expression 
permits the rapid construction of expression librar- 
ies which can be screened for binding activities. 
For Ig heavy and light chain variable genes, the ds 

io cDNA may comprise variable genes isolated as 
complete rearranged genes from the animal, or 
variable genes built from several different sources, 
for example a repertoire of unrearranged VH genes 
combined with a synthetic repertoire of DH and JH 

is genes. Preferably repertoires of genes encoding Ig 
heavy chain variable domains are prepared from 
lymphocytes of animals immunised with an anti- 
gen. 

The screening method may take a range ot 
20 formats well known in the art. For example Ig 
heavy chain variable domains secreted from bac- 
teria may be screened by binding to antigen on a 
solid phase, and detecting the captured domains 
by antibodies. Thus the domains may be screened 
?s by growing the bacteria in liquid culture and bind- 
ing to antigen coated on the suriace of ELlSA 
plates. However, preferably bacterial colonies {or 
phage plaques) which secrete ligands (or modified 
ligands. or ligand fusions with proteins) are 
30 screened for antigen binding on membranes. Either 
the ligands are bound directly to the membranes 
(and for example detected with labelled antigen), or 
captured on antigen coated membranes (and de- 
tected with reagents specific for ligands). The use 
as of membranes offers great convenience in screen- 
ing many clones, and such techniques are well 
known in the art. 

The screening method may also be greatly 
facilitated by making protein fusions with the 
<o ligands. for example by introducing a peptide tag 
which is recognised by an antibody at the N- 
terminal or C-terminat end of the ligand. or joining 
the ligand to an enzyme which catalyses the con- 
version of a colourless substrate to a coloured 
45 product, in the latter case, the binding of ant.gen 
may be detected simply by adding substrate. Alter- 
natively, for ligands expressed and folded correctly 
inside eukaryotic cells, joining of the bgand and a 
domain of a transcriptional activator such as the 
so GAL4 protein of yeast, and joining of antigen to the 
other domain of the GAL4 protein, could form the 
basis for screening binding activities, as described 
by Fields and Song 121). 

The preparation of proteins, or even cells with 
55 multiple copies of the ligands. may improve the 
avidity of the ligand for immobilised ant.gen. and 
hence the sensitivity of the screening method. For 
example, the ligand may be joined to a protem 
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subunit of a muliimeric protein, to a phage coat 
protein or to an outer membrane protein of £. coli 
such as ompA or lamB. Such fusions to phage or 
bacteria! proteins also offers possibilities of select* 
ing bacteria displaying ligands with antigen binding 
activities. For example such bacteria may be 
precipitated with antigen bound to a solid support, 
or may be subjected to affinity chromatography, or 
may be bound to larger cells or particles which 
have been coated with antigen and sorted using a 
fluorescence activated eel! sorter (FACS). The pro- 
teins or peptides fused to the ligands are prefer- 
ably encoded by the vector, such that cloning of 
the ds cDNA repertoire creates the fusion product. 

In addition to screening for binding activities of 
single ligands. it may be necessary to screen for 
binding or catalytic activities of associated ligands. 
for example, the associated lg heavy and light 
chain variable domains. For example, repertoires of 
heavy and light chain variable genes may be clon- 
ed such that two domains are expressed together. 
Only some of the pairs of domains may associate, 
and only some of these associated pairs may bind 
to antigen. The repertoires of heavy and light chain 
variable domains could be cloned such that each 
domain is paired at random. This approach may be 
most suitable for isolation of associated domains in 
which the presence of both partners is required to 
form a cleft. Alternatively, to allow the binding of 
hapten. Alternatively, since the repertoires of light 
chain sequences are less diverse than those of 
heavy chains, a sm2ll repertoire of light chain vari- 
able domains, for example including representative 
members of each family of domains, may be com- 
bined with a large repertoire of heavy chain vari- 
able domains. 

Preferably however, a repertoire of heavy chain 
variable domains is screened first for antigen bind- 
ing in the absence of the light chain partner, and 
then only those heavy chain variable domains bind- 
ing to antigen are combined with the repertoire of 
light chain variable domains. Binding of associated 
heavy and light chain variable domains may be 
distinguished readily from binding of single do- 
mains, for example by fusing each domain to a 
different C-terminal peptide tag which are specifi- 
cally recognised by different monoclonal anti- 
bodies. 

The hierarchical approach of first cloning heavy 
chain variable domains with binding activities, then 
cloning matching light chain variable domains may 
be particularly appropriate for the construction of 
catalytic antibodies, as the heavy chain may be 
screened first for substrate binding. A light chain 
variable domain would then be identified which is 
capable of association with the heavy chain, and 
"catalytic" residues such as cysteine or histidine 
(or prosthetic groups) would be introduced into the 



CDRs to stabilise the transition state or attack the 
substrate, as described by Baldwin and Schultz 
[22J- 

ASthough the binding activities of norvcovalen- 
5 tly associated heavy and light chain variable do- 
mains (Fv .fragments) may be screened, suitable 
fusion proteins may drive the association of the 
variable domain partners. Thus Fab fragments are 
more likely to be associated than the Fv fragments, 
to as the heavy chain variable domain is attached to a 
single heavy chain constant domain., and the light 
chain variable domain is attached to a single light 
chain variable domain, and the two constant do- 
mains associate together. 
I5 Alternatively the heavy and light chain variable 

domains are covalently linked together with a pep- 
tide as in the single chain antibodies, or peptide 
sequences attached, preferably at the C-terminal 
end which will associate through forming. cysteine 
20 bonds or through non-covalent interactions, such as 
the introduction of "leucine zipper" motifs. How- 
ever, in order to isolate pairs of tightly associated 
variable domains, the Fv fragments are preferably 
used. 

25 The construction of Fv fragments iso.ated from 

a repertoire, of variable region genes offers a way 
of building complete antibodies, and an alternative 
to hybridoma technology. For example by attaching 
the variable domains to tight or suitable heavy 
30 chain constant domains, as appropriate, and ex- 
pressing the assembled genes in mammalian cells, 
complete antibodies may be made and should pos- 
sess natural effector functions, such as comple- 
ment lysis. This route is particularly attractive for 
35 the construction of human monoclonal antibodies, 
as hybridoma technology has proved difficult, and 
for example, although human peripheral blood lym- 
phocytes can be immortalised with Epstein Barr 
virus, such hybridomas tend to secrete low affinity 
40 IgM antibodies. . 

Moreover, it is known that immmuriological 
mechanisms ensure that lymphocytes do not gen- 
erally secrete antibodies directed against host pro- 
teins. However it is desirable to make human ant>- 
bodies directed against human proteins, for "exam- 
ple to human cell surface markers to treat cancers, 
or to histocompatibility antigens to treat auto-.m- 
mune diseases. The construction of human anti- 
bodies built from the combinatorial repertoire of 
so heavy and tight chain variable domains may over- 
come this problem, as it will allow human any- 
bodies to be built with specificities which would 
normally have been eliminated. 

The method also offers a new way of making 
55 bispecific antibodies. Antibodies with dual specia- 
lty can be made by fusing two hybndomas of 
different specificities, so as to make a hybr.d anti- 
body with an Fab arm ol one specificity, and the 
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other Fab arm nf p. .cynnti specificity. Howeyerjpe^ . 
yields of the bispsciiic antibody are low. as heavy 
and light chains also find the wrong partners. The 
construction of Fv fragments which are tightly as- 
sociated should preferentially drive the association 
of the correct pairs of heavy with light chains. (It 
would not assist in the correct pairing of the two 
heavy chains with each other.) The improved pro- 
duction of bispecific antibodies would have a vari- 
ety of applications in diagnosis and therapy, as is 
well known. 

Thus the invention provides a species specific 
general oligonucleotide primer or a mixture of such 
primers useful for cloning variable domain encod- 
ing sequences from animals of that species. The 
method allows a single pair or pair of mixtures of 
species specific general primers to be used to 
clone any desired antibody specificity from that 
species. This eliminates the need to carry out any 
sequencing of the target sequence to be cloned 
and the need to design specific primers for each 
specificity to be recovered. 

Furthermore it provides for the construction of 
repertoires of variable genes, for the expression of 
the variable genes directly on cloning, for the 
screening of the encoded domains for binding ac- 
tivities and for the assembly of the domains with 
other variable domains derived from the repertoire. 

Thus the use of the method of the present 
invention will allow for the production of heavy- 
chain variable domains with binding activities and 
variants of these domains. It allows for the produc- 
tion of monoclonal antibodies and bispecific anti- 
bodies, and will provide an alternative to hybridom.2 
technology. For instance, mouse, splenic ds mRNA 
or genomic DNA may be obtained from a hyperim- 
munised mouse. This could be cloned using, the 
method of the present invention and then the clon- 
ed ds DNA inserted into a suitable expression 
vector. The expression vector would be used to 
transform a host cell, for instance a bacterial cell, 
to enable it to produce an Fv fragment or a Fab 
fragment. The Fv or Fab fragment would then be 
built into a monoclonal antibody by attaching con- 
stant domains and expressing it in mammalian 
cells. 

The present invention is now described, by 
way of example only, with reference to the accom- 
panying drawings in which: 

Figure 1 shows a schematic representation 
of the unrearranged and rearranged heavy and light 
chain variable genes and the location of the prim- 
ers; 

Figure 2 shows a schematic representation 
of the M13-VHPCR1 vector and a cloning scheme 
for amplified heavy chain variable domains; 

Figure 3 shows the sequence of the lg vari- 
able region derived sequences in M13-VHPCR1; 



Figure 4 shows a schematic representation 
"of the M13-VKPCR1 vector and a cloning scneme 
for light chain variable domains: 

" Figure 5 shows the sequence of the lg vari- 
s able region derived sequences in M13-VKPCR1; 

Figure 6 shows the nucleotide sequences of 
the heavy and light cnain variable domain encoding 
sequences o' MAb MBri: 

Figure 7 shows a schematic representation 
jo of the pSV-gpt vector (also known as a-lys 30) 
which contains a variable region cloned as a 
Hindlll-BamHI fragment, which is excised on in- 
troducing the new variable region. The gene for 
human IgGl h£s also been engineered to remove a 
is BamHI site, such that the BamHl site in the vector 
is unique; 

Figure 8 shows a schematic representation 
of the pSV-hygro vector (also known as a-Lys 17). 
It is derived from pSV gpt vector with the gene 
20 encoding mycophenoiic acid replaced by a gene 
coding for hygromycin resistance. The construct 
contains a variable gene cloned as a Hindill-BamHl 
fragment which is excised on introducing the new 
variable region. The gene for human C* has also 
25 been engineered to remove a BamHI site, such that 
the BamHI site in the vector is unique; 

Figure 9 shows the assembly of the mouse: 
human MBn chimaeric antibody; 

Figure 10 shows encoded amino acid se- 
30 quences of 48 mouse rearranged VH genes; 

Figure 11 shows encoded amino acid se- 
quences of human rearranged VH genes: 

Figure 12 shows encoded amino acid se- 
quences of unrearranged human VH genes; 
35 Figure .13 shows the sequence of pan of the 

plasmid pSWl: essentially the sequence of a pec- 
tate lyase leader linked to VHLYS in pSWi and 
cloned as an Sphl-EcoRI fragment into pUCi9 and 
the translation of the open reading frame encoding 
40 the pectate lyase leader-VHLYS polypeptide being 
shown; 

Figure 14 shows the sequence of part of the 
plasmid pSW2: essentially the sequence of a pec- 
tate lyase leader linked to VHLYS and to VKLYS. 

<5 and cloned as an Sphl-EcoRl-EcoRI fragment into 
PUC19 and the translation of open reading frames 
encoding the pectate lyase leader-VHLYS and pec- 
tate lyase feader-VKLYS polypeptides being shown; 
Figure 15 shows the sequence of part of the 

so plasmid pSWlHPOLYMYC which is based on 
pSWl and in which a potylinker sequence has 
replaced the variable domain of VHLYS. and acts 
as a cloning site for amplified VH genes, and a 
peptide tag is introduced at the C-terminal end; 

s5 Figure 16 shows the encoded amino acid 

sequences of two VH domains derived from mouse 
spleen and having lysozyme binding activity, and 
compared with the VH domain of the Dl.3 anti- 
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body. The arrows mark the points of difference 
between the two VH domains: 

Figure 17 shows the encoded amino acid 
sequence of a VH domain derived from human 
peripheral blood lymphocytes and having lysozyme 
binding activity; 

Figure 18 shows a scheme for generating 
and cloning mutants of the VHLYS gene, which is 
compared with the scheme for cloning natural rep- 
ertoires of VH genes: 

Figure 19 shows the sequence of part of the 
vector pSW2HP0LY; 

Figure 20 shows the sequence of part of the 
vector pSW3 which encodes the two linked VHLYS 
domains: 

Figure 21 shows the sequence of the VHLYS 
domain and pe!B leader sequence fused to the 
alkaline phosphatase gene; 

Figure 22 shows the sequence of the vector 
pSWIVHLYSVKPOLYMYC for expression of a rep- 
ertoire of V* light chain variable domains in associ- 
ation with the VHLYS domain; and 

Figure 23 shows the sequence of VH domain 
which is secreted at high levels from £ colt. The 
differences with VHLYS domain are marked. 



PRIMERS 

In the Examples described below, the following 
oligonucleotide- primers, or mixed primers were 
used. Their locations are marked on Figure i and 
sequences are as follows: 

VHlFOFt 5' TGAGGAGACGGTGACCGTGGTCCC- 
TTGGCCCCAG 3 : : 

VH1FOR-2 5' TGAGGAGACGGT- 

GACCGTGGTCCCTTGGCCCC 3: 
HulVHFOR 5' CTTGGTGGAGGCTGAGGAGACG- 
GTGACC 3': 
Hu2VHFOR 
GTGACC 3; 
Hu3VHFOR 
GTGACC 3'; 

Hu4VHFOR 5 CTTGGTGGATGCTGATGAGACGG- 
TGACC 3; 

MOJH1FOR 5' TGAGGAGACGGTGACCGTGGTC- 
CCTGCGCCCCAG 3; 

MOJH2FOR 5' TGAGGAGACGGTGACCGTGGTG- 
CCTTGGCCCCAG 3*; 

MOJH3FOR 5' TGCAGAGACGGTGACCAGAGTC- 
CCTTGGCCCCAG 3*: 

MOJH4FOR . 5' TGAGGAGACGGTGACCGAGGT- 
TCCTTGACCCCAG 3'; 

HUJHl FOR 5* TGAGGAGACGGTGACCAGGGTG- 
CCCTGGCCCCAG 3; 

HUJH2FOR 5' TGAGGAGACGGTGACCAGGGTG* 
CCACGGCCCCAG 3': 

HUJH4FOR 5' TGAGGAGACGGTGACCAGGGT- 
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5' CTTGGTGGAGGCTGAGGAGACG- <0 
5' CTTGGTGGATGCTGAGGAGACG- 
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TCCTTGGCCCCAG 3 ; 

VK1FOR 5* GTTAGATCTCCAGCTTGGTCCC Z ^ 
VK2FOR 5' CGTTAGATCTCCAGCTTGGTCCC 3'; 
VK3FOR 5' CCGTTTCAGCTCGAGCTTGGTCCC 
3'; 

MOJK1FOR 
3'; 

MOJK3FOR 
3'; 

MOJK4FOR 
3': 

HUJK1FOR 

3*; 

HUJK3FOR 
3'; 

HUJK4FOR 
3'; 

HLUK5FOR 
3'; 

VH1BACK 

(G'C)AGTC(T/A)GG 3 
5' 



CGTTAGATCTCCAGCTTGGTGCC 
GGTTAGATCTCCAGTCTGGTCCC 



CGTTAGATCTCCAACTTTGTCCC 



CGTTAGATCTCCACCTTGGTCCC 
CGTTAGATCTCCACTTTGGTCCC 
CGTTAGATCTCCACCTTGGTCCC 
CGTTAGATCTCCAGTCGTGTCCC 



-5' AGGT(C/G)(C/A}A(G/A)CTGCAG- 



CAGGTGCAGCTGCAG- 



5 CAGGTGCAGCTGCAG* 
5 GAGGTGCAGCTGCAG- 
5' CAGGTGCAGCTGCAG- 
AGGTGCAGCTGCAGGAGTCAG 



AGGTCCAACTGCAG- 
AGGTGAAGCTGCAG- 



Hu2VHIBACK: 
CAGTCTGG 3: 
HuVHHBACK: 
GAGTCGGG 3': 
Hu2VHillBACK: 
GAGTCTGG 3: 
HuVHIVBACtt 
CAGTCTGG 3; f 
MOVHIBACK 5' 
3': 

MOVHIIABACK 5 AGGTCCAGCTGCAGCA(G/A)- 
TCTGG 3*; 
MOVHIIBBACK 
CAGCCTGG 3* 
; MOVHIIBACK 
GAGTCTGG 3; 

VK1BACK 5' GACATTCAGCTGACCCAGTCTCCA 
3 : 

VK2BACK 5' GACATTGAGCTCACCCAGTCTCCA 
3; 

MOVKIIABACK 5 GATGTTCAGCTGACCCAAAC- 
TCCA 3 

MOVKIIBBACK 5' GATATTCAGCTGACCCAGGAT- 
GAA 3; 

HuHeplFOR 5' C(A/G)(C/G)- 

TGAGCTCACTGTGTCTCTCGCACA 3 ; 
HuOctal BACK 5 C6TGAATATGCAAATAA 3 : # 
Hupcta2BACK 5' AGTAGGAGACATGCAAAT 3: 
and 

HuOctaSBACK 5' CACCACCCACATGCAAAT 3 ; 
VHMUT1 5' GGAGACGGTGACCGTGGTCCCTTG- 
GCCCCAGTAGTCAAG 

NNNNNNNNNNNNCTCTCTGGC 3* (where N is a/i 
equimolar mixture of T, C, G and A) 
M13 pRIMER 5' AACAGCTATGACCATG 3 (New 
England Biolabs *1201) 
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EXAMPLE .1 



.... Jidmer was used in place of the VHlFOR primer. 



Cloning of Mouse Rearranges Variable region 
geneslrom hybrioomas. assembly of genes encod- 
jnlT Thimaeric antibodies and ine expression of 
antibodies from myeloma cells 

VHlFOH is designed to anneal with the 3 end 
of the sense strand of any mouse heavy chain 
variable domain encoding sequence. It contains a 
BstElt recognition site. VK1FOR is designed to 
anneal with the 3* end of the sense strand of any 
mouse kappa-type light chain variable domain en- 
coding sequence and contains a Bglll recognition 
site. VH1BACK is designed to anneal with the 3 
end of the antisense strand of any mouse heavy 
chain variable domain and contains a Pstl recogni- 
tion site. VKf BACK is designed to anneal with the 
3' end of the antisense strand of any mouse 
kappa-type light chain variable domain encoding 
sequence and contains a Pvull recognition site. 

In this Example five mouse hybridomas were 
used as a source of ds nucleic acid. The 
hybridomas produce monoclonal antibodies (MAbs} 
designated MBM [23]. BW43V26 [24]. BW494/32 
[25]. BW250/183 [24,26] and BW704/152 [27]. MAb 
MBM is particularly interesting in that it is known to 
be specific for a saccharide epitope on a human 
mammary carcinoma line MCF-7 [28]. 



Cloning via mRNA 

Each of the five hybridomas referred to above 
was grown up in roller bottles and about 5 x 10 s 
cells of each hybridoma were used to isolate RNA. 
mRNA was separated from the isolated RNA using 
cligodT cellulose [29]- First strand cDNA was syn* 
thesised according to the procedure described by 
Maniatis et al. [30] as set out below. 

In order to clone the heavy chain variable do- 
main encoding sequence, a 50 ul reaction solution 
which contains 10 ug mRNA. 20 pmole VH1FOR 
primer. 250 uM each of dATP, dTTP. dCTP and 
dGTP. 10 mM dithiothreitol (DTT). 100 mM 
Tris.HCI. 10 mM MgClz and 140 mM KCI. adjusted 
to pH 8.3 was prepared. The reaction solution was 
heated at 70* C for ten minutes and allowed to cool 
to anneal the primer to the 3' end of the variable 
domain encoding sequence in the mRNA. To the 
reaction solution was then added 46 units of re- 
verse transcriptase (Anglian Biotec) and the solu- 
tion was then incubated at 42 'C for 1 hour to 
cause first strand cDNA synthesis. 

in order to clone the light chain variable do- 
main encoding sequence, the same procedure as 
set out above was used except that the VK1 FOR 



Amplification from RNA'DNA hybrid 

5 

Once the ds RNA'DNA hybrids had been pro- 
duced, the variable domain encoding sequences 
were amplified as follows. For heavy chain variable 
domain encoding sequence amplification, a 50 ul 

10 reaction solution containing 5 ul of the ds 
RNA/DNA hybrid-containing solution. 25 pmole 
each of VH1 FOR and VH1BACK primers. 250 uM 
of dATP. dTTP. dCTP and dGTP. 67 mM Tris.HCI. 
17 mM ammonium sulphate. 10 mM MgClj. 200 

:5 ug/ml gelatine and 2 units Taq polymerase (Cetus) 
was prepared. The reaction solution was overlaid 
with paraHin oil and subjected to 25 rounds of 
temperature cycling using a Techne PHC-1 prog- 
rammable heating block. Each cycle consisted of 1 

20 minute and 95' C (to denature the nucleic acids). 1 
minute al 30* C (to anneal the primers to the nu- 
cleic 'acids) and 2 minutes at 72* C (to cause 
elongation from the primers). After the 25 cycles, 
the reaction solution and the oil were extracted 

25 twice with ether, once with phenol and once with 
phenol/CHCI3. Thereafter ds cDNA was precipitat- 
ed with ethanol. The precipitated ds cDNA was 
then taken up in 50 ul of water and frozen. 

The procedure for light chain amplification was 

30 exactly as described above, except that ' the 
VK1FOR and VK1 BACK primers were used in 
place of the VHiFOR and VH1BACK primers re- 
spectively. 

5 ul of each sample of amplified cDNA was 
35 fractionated on 2% agarose gels by electrophoresis 
and stained with ethidium bromide. This showed 
that the amplified ds cDNA gave a major band of 
the expected size (about 330 bp). (However the 
band for VK DNA of MBM was very weak. It was 
<o therefore excised from the gel and reamplified in a 
second round.) Thus by this simple procedure, 
reasonable quantities of ds DNA encoding the light 
and heavy chain variable domains of the five MAbs 
were produced. 

Heavy Chain Vector Construction 

A BstEll recognition site was introduced into 
so the vector M13-HuVHNP 131] by site directed 
mutagenesis [32.33] to produce the vector Ml 3- 
VHPCR1 (Figures 2 and 3). 

Each amplified heavy chain variabte domain 
encoding sequence was digested with the restric- 
55 tion enzymes Pstl and BstEll. The fragments were 
phenol extracted, purified on 2% low melting point 
agarose gels and force cloned into vector Ml 3- 
VHPCR1 which had been digested with Pstl and 
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BstEII and purified on an 0.6% agarose gel. Clones 
containing the variable domain inserts were iden- 
tified directly by sequencing [34] using primers 
based in the 3 non-coding variable gene in tne 
M13-VHPCR1 vector. 

There is an internal Pstl site in the heavy cr-.ain 
variable domain encoding sequences of BW43V25. 
This variable domain encoding sequence was 
therefore assembled in two steps. The 3 Pstl- 
BstEll fragment was first cloned into M13-VHPCR1. 
followed in a second step by the 5 Pstl fragment. 



Light Chain Vector Construction 

Vector Ml3mpl8 [35] was cut with Pvul! arc 
the vector backbone was blunt ligated to a syn- 
thetic Hindlil-BamHI polylinker. Vector M13-HuV- 
KLYS [36] was digested with Hindlll and BamHl to 
isolate the HuVKLYS gene. This Hindltl-Barr.Hi 
fragment was then inserted into the Hindlll-Barr.Hi 
polylinker site to form a vector M13-VKPCR1 which 
lacks any Pvull sites in the vector backbone 
(Figures 4 and 5). This vector was prepared in E 
Coli JM110 [22] to avoid dam methylation at the 
Bell site. 

Each amplified light chain variable domain en- 
coding sequence was digested with Pvull and Belli. 
The fragments were phenol extracted, purified en 
2% low melting point agarose gels and force clon- 
ed into vector M13-VKPCR1 which had been di- 
gested with Pvull and Bell, purified on an 0.3% 
agarose gel and treated with calf intestinal 
phosphatase. Clones containing the light chain vari- 
able region inserts were identified directly by- 
sequencing [34] using primers based in the 3 ncn* 
coding region of the variable domain ir. the Ml 3- 
VKPCR1 vector. 

The nucleotide sequences of the MBrl heavy 
and light chain variable domains are shown in 
Figure 6 with part of the flanking regions of the 
M13-VHPCR1 and M13-VKPCR1 vectors. 



Antibody Expression 

The Hindlll-BamHI fragment carrying the M3n 
heavy chain variable domain encoding sequence in 
M13-VHPCR1 was recloned into a pSV-gpt vector 
with human -y1 constant regions [37J (Figure 7). 
The MBM light chain variable domain encoding 
sequence in M13-VKPCR1 was recloned as a 
Hindlll-BamHI fragment into a pSV vector. PSV- 
hyg-HuCK with a hygromycin resistance marker 
and a human kappa constant domain (Figure 8). 
The assembly of the genes is summarised in Fig- 
ure 9. 

The vectors thus produced were linearised with 



Pvul (in the case of the pSV-hygro vectors the Pvul 
digest is only partial) and cotr2nsfected into the 
non-secreting mouse myeloma line NSO [38] by 
electrcporation [39j. One day after cotransfection. 

5 cells were selected in 0.3 ug'ml mycophenolic acid 
(MPA) and after seven days in lug/ml MPA. After 
14 days, lour wells, each containing one or two 
major colonies, were screened by incorporation of 
'*C-iysine (40) and the secreted antibody detected. 

i0 after precipitation with protein-A SepharoseTU - 
(Pharmacia) on SDS-PAGE [41]. The gels were 
stained, fixed, soaked in a fluorographic reagent. 
Amplify TW (Amersham). dried and autoradiograph- 
ed on pref lashed film at -70* C for 2 days. 

15 Supernatant was also tested for binding to the 

mammary carcinoma line MCF-7 and the colon 
carcinoma line HT-29. essentially as described by 
Menard et al. [23]. either by an indirect im- 
munoflorescence assay on cell suspensions (using 

?o a fluorescein-labelied goat anti-human lgG 
(Amersham)) or by a solid phase RIA on mon- 
olayers of fixed cells (using ,2S l-protein A 
(Amersham)). 

tt was found that one of the supernatants from 

25 the four wells contained secreted antibody. The 
chimeric antibody in the supernatant, like the par- 
ent mouse MBr1 antibody, was found to bind to 
MCF-7 cells but not the HT-29 cells, thus showing 
thai the specificity had been properly cloned and 

30 expressed. 



Example 2 

35 

Cloning of rearranged variable genes from genomic 
DNA of mouse spleen 



40 Preparation of DNA from spleen. 

The DNA from the mouse spleen was prepared 
in one of two ways (although other ways can be 
used). 

Method 1. 

A mouse spleen was cut into two pieces and 
50 each piece was put into a standard Eppendort tube 
with 200 ul of PBS. The tip of a 1 ml glass pipette 
was closed and rounded in the blue flame of a 
Bunsen burner. The pipette was used to squash 
the spleen piece in each tube. The cells thus 
55 produced were transferred to a fresh Eppendorf 
tube and the method was repeated three times 
until the connective tissue of the spleen appeared 
white. Any connective tissue which has been trans- 
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ferred with.ihe celis-^as removed using_a. l^yn : . 
out Pasteur pipette. The cells were then washed in 
PBS and distributed into four tubes. 

The mouse spleen cells were then sedi-.emed 
by a 2 minute spin in a Microcentaur centrifuge at 
low speed setting. All the supernatant was as- 
pirated with a drawn out Pasteur pipette. If cesired. 
z\ this point the cell sample can be frozen and 
stored at -20 C 

To the cell sample (once thawed if it hac been 
frozen) was added 500 ul of water and 5 ul of a 
10% solution of NP-40. a non-ionic detergent. The 
tube was closed and a hole was punched in the lid. 
The tube was placed on a boiling water bath for 5 
minutes to disrupt the cells and was then cooled on 
ice for 5 minutes. The tube was then spun for 2 
minutes at high speed to remove cell debris. 

The supernatant was transferred to a new tube 
and to this was added 125 ul 5M NaCi ana 30 ul 
1M MOPS adjusted to pH 7.0. The DNA in the 
supernatant was absorbed on a Quiagen 5 tip and 
purified following the manufacturer's instructions for 
lambda DNA. After isoprocano! precipitation, the . 
DMA was resuspended in 500 ul water. 



Method 2. 

This method is based on the technicue' de- 
scribed in Maniatis et ai: [30]. A mouse spieen was 
cut into very fine pieces and put into a 2 ml glass 
homogeniser. The cells were then freed from the 
tissue by several slow up and down strokes with 
the piston. The cell suspension was made in 500 u! 
phosphate buffered saline (PBS) and transferred to 
an Eppenaorf tube. The cells were then soun for 2 
min at low speed in a Micrccentaur centrifuge. This 
results in a visible separation of white and red 
cells. The white cells, sedimenting slower, form a 
layer on top of the red cells. The supernatant was 
carefully removed and spun to ensure that all the 
white cells had sedimented. The layer of white 
cells was resuspended in two portions of 500 ul 
PBS and transferred to another tube. 

The white cells were precipitated by spinning 
in the Microcentaur centrifuge at low speed for one 
minute. The cells were washed a further two times 
with 500 ul PBS, and were finally resuspended in 
200 ul PBS. The white cells were added to 2.5 ml 
25 mM EOTA and 10 mM Tris.Cl. pH 7.4, and 
vortexed slowly. While vortexing 25 ul 20% SDS 
was added. The cells lysed immediately and the 
solution became viscous and clear. 100 ul of 20 
mg/ml proteinase K was acded and incubated one 
to three hours at 50 *C. 

The sample was extracted with an equal vol- 
ume of phenol and the same volume of chloroform, 
and vortexed. After centrifuging. the aqueous phase 



_ _ w ?s removed and 1/10 volume 3M ammonium ace- 
tate was added. This was overlaid with three vol- 
umes of cold ethanol and the tube rocked carefully 
until the DNA strands became visible. The DNA 
s was spooled out with a Pasteur pipette, the ethanol 
allowed to drip off. and the DNA transferred to 1 ml 
of 10 mM Tris.Cl pH 7.4. 0.1 mM EOT A in an 
Eppendorf tube. The DNA was allowed to dissolve 
in the cold overnight on a roller. 

10 

Amplification from genomic DNA. 

The DNA solution was diluted 1/10 in water and 
15 boiled for 5 min prior to using the polymerase 
chain reaction (PCR). For each PGR reaction, typi- 
cally 50-200 ng of DNA were used. 

The heavy and light chain variable domain en- 
coding sequences in the genomic DNA isolated 
20 from the human PBL or the mouse spleen cells 
was then amplified and cloned using the general 
protocol described in the first two paragraphs of 
the section headed "Amplification from RNA'DNA 
Hybrid" in Example 1. except that during the an- 
25 neaiing part of each cycle, the temperature was 
held a: 65 *C and that 30 cycles were used. Fur- 
thermore, to minimise the annealing between the 3 
ends of the two primers, the sample was first 
heated to 95* C then annealed at 65" C. and only 
30 then was the Taq polymerase added. At the end of 
the 30 cycles, the reaction mixture was held at 
60 *C for five minutes to ensure that complete 
elongation and renaturation of the amplified frag- 
ments had taken place, 
as • The primers used to amplify the mouse spleen 
genomic DNA were VHlFOR and VH1BACK. for 
the heavy chain variable domain and VK2FOR and 
VK1BACK. for the tight chain variable domain. 
(VK2FOR only differs frorn VKlFOR in that it has 
410 an extra C residue on the 5 end.) 

Other sets of primers, designed to optimise 
annealing with different families of .mouse VH and 
V* genes were devised and used in mixtures with 
the primers above. For example, mixtures of 
as VKlFOR. MOJK1FOR. MOJK3FOR and 
MOJK4FOR were used as forward primers and 
mixtures of VK1BACK. MOVKIIABACK and MOV- 
KIIBBACK as back primers for amplification of V« 
genes. Likewise mixtures of VHlFOR. MOJHlFOR. 
so MOJH2FOR, MOJH3FOR and MOJH4FOR were 
used as forward primers and mixtures of 
VH1BACK, MOVHIBACK. MOVHUABACK. 
MOVHIIBBACK, MOVHII1BACK were used as back- 
ward primers (or amplification of VH genes. 
55 AH these heavy chain FOR primers referred to 

above contain a BstEH site and all the BACK prim- 
ers referred to above contain a Pstl site. These 
light chain FOR and. BACK primers referred to 
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above all contain Balll and Pvutl sites respectively- 
Light chain primers (VK3FOR anci VK2BACK) were 
also devised which utilised different restriction 

sites. Sad and Xhol. 

Typically all these primers yielced amplified 
DNA of the correct size on gel electrophoresis, 
although other bands were also present. However, 
a problem was identified in which the 5 and 3 
ends of the forward and backward primers for the 
VH genes were partially complementary, and this 
could yield a major band of "primer-dimer" in 
which the two oligonucleotides prime on each oth- 
er. For this reason an improved forward primer. 
VHlFOR-2 was devised in which the two 3 
nucleotides were removed from VHiFOR. 

Thus, the preierred amplification conditions for 
mouse VH genes are as follows: the sample was 
made in a volume of 50-100 ul. 50-100 ng of DNA. 
VHiFOR-2 and VHlBACK primers (25 pmole of 
each). 250 uM of each deoxynudeotide 
triphosphate. 10 mM Tris.HCI. pH 8.S. 50 mM KCI. 
1.5 mM MgCli. and lOO.ug/ml gelatine. The sam- 
ple was overlaid with paraffin oil. heated tc^95 C 
for 2 min, 65* C for 2 min. and then to 72 C: taq 
polymerase was added after the sample had 
reached the elongation temperature and the reac- 
tion continued for 2 min at 72* C. The sample was 
subjected to a further 29 rounds of temperature 
cycling using the Techne PHC-1 programmable 
heating block. 

The preferred amplification conditions for 
mouse Vk genes from genomic DNA are as fol- 
lows- the sample treated as above except with V* 
primers, for example VK3FOR and VK2BACK. and 
using a cycle of 94* C for one minute. 60* C for 
one minute and 72* C for one minute. 

The conditions which were devised for 
genomic DNA are also suitable for amplification 
from the cDNA derived from mRNA from mouse 
spleen or mouse hybridoma. 

Cloning and analysis of variable region genes 

The reaction mixture was then extracted twice 
with 40 ul of water-saturated diethyl ether. This 
was followed by a standard phenol extraction and 
ethanol precipitation as described in Example 1. 
The DNA pellet was then dissolved in 100 al 10 
mM Tris.CI. 0.1 mM EDTA. 

Each reaction mixture containing a light chain 
variable domain encoding sequence was digested 
with Sad and Xhol (or with Pvull and Bglll) to 
enable it to be ligated into a suitable expression 
vector. Each reaction mixture containing a heavy 
chain variable domain encoding sequence was di- 
gested with Pstl and BstEII for the same purpose. 

The heavy chain variable genes isolated as 



above from a mouse hyperimmunised with 
lysozyme were cloned into M13VHPCR1 vector 
and sequenced. The complete sequences of 48 VH 
gene clones were determined (Figure 10). All but 

5 two of the mouse VH gene families were repre- 
sented, with frequencies of: VA (1). IIIC O). "IB (8). 
I1IA (3). MB (17). HA (2). IB (12). IA (4). In 30 clones, 
the D segments could be assigned to families SP2 
(14), FL16 (11) and Q52 (5). and in 38 clones the 

;0 JH minigenes to families JH1 (3). JH2 (7). JH3 (14) 
and JH4 (14). The different sequences of CDR3 
marked out each of the 48 clones as unique. Nine 
pseudogenes and 16 unproductive rearrangements 
were identified. Of the clones sequenced. 27 have 

is open reading frames. 

Thus the method is capable of generating a 
diverse repertoire of heavy chain variable genes 
from mouse spleen DNA. 
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Example 3 

Cloning of rearranged variable genes from mRNA 
Irom human peripheral blood lymphocytes 



Preparation of mRNA. 

Human peripheral blood lymphocytes were pu- 
rified and mRNA prepared directly (Method 1). or 
mRNA was prepared after addition of Epstein Barr 
virus (Method 2). 



Method 1. 



20 ml of heparini sed human blood from a 
healthy volunteer was diluted with an equal volume 

<o of phosphate buffered saline (PBS) and distributed 
equally into 50 ml Falcon tubes. The blood was 
then underlayed with 15ml Ficoll Hypaque 
(Pharmacia 10-A-001-07). To separate the lym- 
phocytes from the red blood cells, the tubes were 

.5 spun for 10 minutes at 1800 rpm at room tempera- 
ture in an IEC Centra 3E table centrifuge. The 
peripheral blood lymphocytes (PBL) were then col- 
lected from the interphase by aspiration with a 
Pasteur pipette. The cells were diluted with an 

so equal volume of PBS and spun again at 1500 rpm 
for 15 minutes. The supernatant was aspirated, the 
cell pellet was resuspended in 1 ml PBS and the 
cells were distributed into two Eppendorf tubes. . 



55 



Method 2. 

40 ml human blood from a patient with HIV in 
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tne pre-AlDS con^ -as layered on J^L£ . 
separate the white celis (see Metnod i above) The 
white cells were then incubated in tissue culture 
Medium for ,-5 days. On cay 3. they were infected 
with Epstein Barr virus. The cel!s were pelleted 
tapprox 2 x 10' cells) and washed in PBS. 

The cells were pelleted acain and lysed w.tn 7 
ml 5M guanidine isothiocyanate. 50 mM Tr.s. 10 
mM EDTA, 0.1 mM dithiothreitol. The cells were 
vortexed vigorously and 7 volumes ol i 4M UCI 
added The mixture was incubated at 4 C tor is- 
20 hrs The suspension was spun and the super- 
natant' suspended in 3M LiC. and centniuged 
aaain The pellet was dissolved in 2ml 0.1 - 
,5 mM Tris HCI and 1 mM EDTA. The suspension 
was frozen at -20'C. and thawed by vortexmg or 
20 s every 10 min for 45 min. A large white pellet 
was left behind and the dear supernatant was 
extracted with phenol chloroform, then with chlo- 
roform The RNA was precipitated by add.ng M0 
volume 3M sodium acetate and 2 vol ethanol and 
,eaving overnight at -20" C. The pellet was sus- 
pended in 0.2 ml water and reprec.pitated w>th 
ethanol. Aliquots for cDNA synthesis were taken 
from the ethanol precipitate which had been vor- 
texed tc create a fine suspension. 

100 ul of the suspension was precip.tated and 
dissolved in 20 ul water for cDNA synthes.s [30] 
using 10 pmole of a HUFOR primer (see below) .n 
nnal volume of 50 ul. A sample of 5 ul of the cDNA 
was amplified as in Example 2 except us.ng the 
primers for ine human VH gene families (see be- 
low) using a cycle of 95* C. 60 C and 72 C. 

The back primers for the amplification of hu- 
man DNA were designed to match the available 
human heavy and light chain sequences, m which 
the different families have, slightly d.fierent 
nucleotide sequences at the 5 end. Thus for the 
. human VH genes, the primers Hu2VHIB*Qt 
HuVHIIBACK. HU2VHHIBACIC and HuVH VBACK 
were designed as back primers, and HUJHlFOR. 
HUJH2FOR and HUJH4FOR as forward primers 
based entirely in the variable gene. Another set of 
forward primers HuWHFOR. Hu2VHFOR 
Hu3VHFOR. and Hu4VHF0R was also used, which 
were designed to match the human- J-regions and 
the S end of the constant regions of different 
. human isotopes. 

Using sets of these primers it was possible to 
demonstrate a band of amplified ds cDNA by gel 
electrophoresis. 

One such experiment was analysed in detail to 
establish whether there was a diverse repertoire in 
a patient with HIV infection. It is known that dur.ng 
the course of AIDS, that T-celis and also amibod.es 
are greatly diminished in the blood. Presumably 
the repertoire of lymphocytes is also diminished. In 
this experiment, for the forward priming, an 
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eouimolar mixture- of primers ^ 1VHF ° R r 
1Tu2VHFOR. Hu3VHFOR. and Hu<VHFOR (m P^R 
25 pmole of primer 5 ends) was used. For the 
back priming. the primers HuZVHlBACK 
HuVHIIBACK. Hu2VHIIlBACK and HuVHlVBACK 
were used separately in four separate primings. 
The amplified DNA Irom the separate primings was 
then pooled, digested with restriction enzymes Pstl 
and BstEll as above, and then cloned into the 
vector M13VHPCR1 for sequencing. The se- 
quences reveal a diverse repertoire (Fig. u> at tn.s 
stage of the disease. 

For human V, genes the primers HuJKlFOR. 
HUJK3FOR. HUJK4FOR and HUJK5FOR were 
used as forward primers and VK1BACK as back 
primer. Using these primers it was possible to see 
a band of amplified ds cDNA of the correct s.ze by 
gel electrophoresis. 



Example 4 



Cloning of unrearranged variable gene gnomic 
2S DNAi^humanT^ 

Human peripheral blood lymphocytes of a pa- 
ticnt with non-Hodgkins lymphoma were prepared 
as in Example 3 (Method M The. genomic DNA 
30 was prepared from the PBL using the technique 
described in Example 2 (Method 2). The VH region 
in the isolated genomic DNA was then amplrfted 
and cloned using the general protocol describee m 
me first two paragraphs of the section headed 
M "Amplification, from RNA-'DNA hybrid m Example 
i above, except that during the annea ng pan of 
each cycle, the temperature was held at 55 C and 
that 30 cycles were used. At ^ f"d o the 30 
cycles, the reaction mixture was held at 60 C for 
« Ze minutes to ensure that complete elongation 
and renaturation of the amplified fragments had 

^^Thrfcrward primer used was HuHeplFOR, 
which contains a Sad site. This primer is designed 

45 to anneal to the 3 end of the 

VH region gene, and in particular includes a se- 
puence complementary to the last three codons m 
the VH region gene and nine nucleotides down- 
stream of these three codons. 

As the back primer, an equimolar mixture of 

50 HuOc a BACK HuOcta2BACK and HuOcta3BACK 
w abused. These primers annea. to . .sequence ,n 
the promoter region of the genomic DNA .W I gene 
(see Fioure 1). 5al of the amplified DNA was 

5S checked on 2% agarose gels in TBE buffer and 
stained wUh ethidium bromide. A double band was 
se n of about 620 nucleotides which corresponds 
o he i*e expected for the unrearranged VH gene. 
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The ds cDNA was digested with Sad and cloned 
into an Ml 3 vector for sequencing. Although there 
are some sequences which are identical, a range of 
different prearranged human VH genes were iden- 
tified (Figure 12}. 



Example 5 

Cloning Variable Domains with Binding Activities 
from a Hybndoma 

The heavy chain variable domain (VHLYS) of 
the 01.3 (anti-lysozyme) antibody was cloned into 
a vector similar to that described previously [42] 
but under the control of the lac z promoter, such 
that the VHLYS domain is attached to a pelB leader 
sequence for export into the periplasm. The vector 
was constructed by synthesis of the pelB leader 
sequence [43], using overlapping oligonucleotides, 
and cfoning into a pUC 19 vector (35). The VHLYS 
domain of the Di.3 antibody was derived from a 
cDNA clone [44] and the construct (pSW1) sequen- 
ced (Figure 13). 

To express both heavy and light chain variable 
domains together, the light chain variable region 
(VKLYS) of the D1.3 antibody was introduced into 
the pSWl vector, with a pelB signal sequence to 
give the construct pSW2 (Figure 14). 

A strain o! E. coli (BMH71-18) 145] was then 
transformed [46.47] with the ptasmid pSWl or 
pSW2. and colonies resistant to ampicillin (100 
ug/ml) were selected on a rich (2 x TY '= per litre 
of water. i6g Bacto-tryptone. lOg yeast extract. 5g 
NaCi) plate which contained 1% glucose to repress 
the expression of variable domain(s) by catabolite 
repression. 

The colonies were inoculated into 50 ml 2 x TY 
(with 1% glucose and 100 ug/ml ampicillin) and 
grown in flasks at 37 *C with shaking tor 12-16 hr. 
The cells were centrifuged. the pellet washed twice 
with 50 mM sodium chloride, resuspended in 2 x 
TY medium containing 100 ug/ml ampicillin and 
the inducer IPTG (1 mM) and grown for a further 
30 hrs at 37* C. The cells were centrifuged and the 
supernatant was passed through a Nalgene filter 
(0.45 urn) and then down a 1 - 5 ml lysozyme- 
Sepharose affinity column. (The column was de- 
rived by coupling lysozyme at 10 mg/ml to CN3r 
activated Sepharose.) The column was first washed 
with phosphate buffered saline (PBS), then with 50 
mM diethylamine to elute the VHLYS domain (from 
pSWl) or VHLYS in association with VKLYS (from 
pSW2). 

The VHLYS and VKLYS domains were iden- 
tified by SDS pclyacrylamide electrophoresis as 
the correct size. In addition. N-terminal sequence 
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determination of VHLYS and VKLYS isolated from a 
polyacryiamide gel showed that the signal peptide 
had been produced correctly. Thus both the Fv 
fragment and the VHLYS domains are able to bind 
to the lysozyme affinity column, suggesting that 
both retain at least some of the affinity of the 
oriqina! antibody. 

" The size of the VHLYS domain was compared 
by FPLC with that of the Fv fragment on Superose 
12. This indicates that the VHLYS domain is a 
monomer. The binding of the VHLYS and Fv frag- 
ment to lysozyme was checked by ELISA. and 
equilibrium and rapid reaction studies were carried 
out using fluorescence quench. 

The ELISA for lysozyme binding was under- 
taken as follows: 

(1) The plates (Dynatech Immulon) were 
coated with 200 ul per well of 300 ug/ml lysozyme 
in 50 mM NaHCOj. pH 9.6 overnight ar room 
tempeature; - 

(2) The wells were rinsed with three washes 
of PBS. and blocked with 300 ul per well of 1% 
Sainsbury's instant dried skimmed milk powder in 
PBS for 2 hours at 37* C; 

(3) The wells were rinsed with three washes 
of PBS and 200 ul of VHLYS or Fv fragment 
(VHLYS associated with VKLYS) were added and 
incubated for 2 hours at room temperature; 

(4) The wells were washed three times with 
0.05% Tween 20 in PBS and then three times with 
PBS to remove detergent; 

(5) 200 ul of a suitable dilution (1:1000) of 
rabbit polyclonal antisera raised against the FV 
fragment in 2% skimmed milk powder in PBS was 
added to each well and incubated at room tem- 
perature for 2 hours; 

(6) Washes were repeated as in (4): 

(7) 200 ul of a suitable dilution (1:1000) of 
goat anti-rabbit antibody (ICN Immunochemicals) 
coupled to horse radish peroxidase, in 2% 
skimmed milk powder in PBS. was added fo each 
well and incubated at room temperature for 1 hour; 

(8) Washes were repeated as in (4); and 

(9) 200 Ul 2.2'azino-bis(3-ethylbenz- 
thiazolinesulphonic acid) [Sigma] (0.55 mg/ml. with 
1 ul 20% hydrogen peroxide: water per 10 ml) was 
added to each well and the colour allowed to 
develop for up to 10 minutes at room temperature. 

The reaction was stopped by adding 0.05% 
sodium azide in 50 mM citric acid pH 4.3. ELISA 
plates were read in a Titertek Muttiscan plate read- 
er Supernatant from the induced bacterial cultures 
of both pSWl (VHLYS domain) or pSW2 (Fv frag- 
ment) was found to bind to lysozyme in the ELISA. 

The purified VHLYS and Fv fragments were 
titrated with lysozyme using fluorescence quench 
(Perkin Elmer LS5B Luminescence Spectrometer) 
to measure the stoichiometry of binding and the 



35 



EP 0 368 684 A1 



36 



affinity constant fotigspzyme [48.49]. The JiUSlion . 
of the Fv "fragment at a concentration of" 30 nM 
indicates a dissociation constant ot 2.8 nM using a 
Scatcharc analysis. 

A similar analysis using fluorescence quench 
and a Scatchard plot was carried out for VHLYS. at 
a VHLYS concentration of 100 nM. The stoichiome- 
try of antigen binding is about 1 mole of tysozyme 
per mole of VHLYS (calculated from plot). (The 
concentration of VH domains was calculated from 
optical density at 280 nM using the typical extinc- 
tion coefficient for complete immunoglobulins.) Due 
to possible errors in measuring low optica! den- 
sities and the assumption about the extinction co- 
efficient, the stoichiometry was also measured 
more carefully. VHLYS was titrated with lysozyme 
as above using fluorescence quench. To determine 
the concentration of VHLYS a sample of the stock 
solution was removed, a known amount of nor- 
- leucine added, and the sample subjected to quan- 
titative amino acid analysis. This snowed a 
stoichiometry of 1.2 mole of tysozyme per mole of 
VHLYS domain. The dissociation constant was cal- 
culated at about 12 nM. 

The on-rates for VHLYS and Fv fragments with 
lysozyme were determined by stopped-flow analy- 
sis (HI Tech Stop Flow SHU machine) under 
pseudo-first order conditions with the fragment at a 
ten fold higher concentration than lysozyme [50]. 
The concentration of lysozyme binding sites was 
first measured by titration with lysozyme using 
fluorescence quench as above. The on rates were 
calculated per mole of binding site (rather than 
amount of VHLYS protein). The on-rate for the Fv 
fragment was found to be 2.2.x 10 s M" 1 s"' at 
25' C. -The on-rate for the VHLYS fragment found 
to be 3.8 x 10 s NT' s" 1 and the off-rate 0.075 s - ' 
at 20* C. The calculated affinity constant is 19 nM. 
Thus the VHLYS binds to lysozyme with a dis- 
sociation constant of about 19 nM. compared with 
that of the Fv of 3 nM. 



Example 6 



Cloning complete variable domains with binding 
activities from mRNA or DNA of antibooysecreting 
cells 

A mouse was immunised with hen egg white 
lysozyme (100 ug i.p. day 1 in complete Freunds 
adjuvant), after 14 days immunised i.p. again with 
100 ug lysozyme with incomplete Freunds ad- 
juvant, and on day 35 i.v. with 50 ug lysozyme in 
saline. On day 39. spleen was harvested. A second 
mouse was immunised with keyhole limpet 
haemocyanin (KLH) in a similar way. The DNA was 



prepared from the spleen according to Example 2 
' (Method 2). The VH genes were amplified accord- 
ing to the preferred method in Example 2. 

Human peripheral blood lymphocytes from a 

s patient infected with HIV were prepared as in Ex- 
ample 3 (Method 2) and mRNA prepared. The VH 
genes were amplified according to the method 
described in Example 3. using primers designed 
for human VH gene families. 

,o After the PCR. the reaction mixture and oil 

were extracted twice with ether, once with phenol 
and once with phenol'CHCIj. The double stranded 
DNA was then taken up in 50 ul of water and 
frozen. 5 ul was digested with Pstl and BstEII 

I9 (encoded within the amplification primers) and 
loaded on an agarose gel for electrophoresis. The 
band of amplified DNA at about 350 bp was ex- 
tracted. 

20 

Expression of anti-lysozyme activities 

The repertoire of amplified heavy chain vari- 
able " domains (from mouse immunised with 
25 lysozyme and from human PBLs) was then cloned 
directly into the expression vector 
pSWlHPOLYMYC. This vector is derived from 
P SW1 except that the VHLYS gene has been re- 
moved and replaced by a polylinker restriction site. 
30 A sequence encoding a peptide tag was inserted 
(Figure 15). Colonies were tooihpicked into 1 ml 
cultures. After induction (see Example 5 for de- 
tails) 10 ul of the supernatant from lourteen 1 ml 
cultures was loaded on SDS-PAGE gels and the 
35 proteins transferred electrophoreticaiiy to nitrocel- 
lulose. The blot was probed with antibody 9E10 
directed against the peptide tag. 

The probing was undertaken as follows. The 
nitrocellulose filter was incubated in 3% bovine 
co serum albumin (BSAVTBS buffer for 20 min (10 x 
TBS buffer is 100 mM Tris.HCL pH 7.4. 9% w/v 
NaCI) The filter was incubated in a. suitable dilution 
of antibody 9E10 (about 1/500) in 3% BSA/TBS for 
1 - 4 hrs. After three washes in TBS (100 ml per 
4S wash, each wash for 10 min). the filler was in- 
cubated with 1:500 dilution of anti-mouse ant.body 
(peroxidase conjugated anti-mouse Ig (Dakopats)) 
in 3% BSA/TBS for 1 - 2 hrs. After three washes .n 
TBS and 0.1% Triton X-100 (about 100 ml per 
so wash, each wash for 10 min). a solution containing 
10 ml chloronapthol in methanol (3 mg/ml). 40 ml 
TBS and 50 ul hydrogen peroxide solution was 
added over the blot and allowed to react for up to 
10 min. The substrate was washed out with excess 
55 water. The blot revealed bands similar m mobility 
to VHLYSMYC on the Western blot, showing that 
other VH domains could be expressed. 

Colonies were then toothpicked individually into 
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wells of an EL1SA plate (200 ul) for growth and 
induction. They were assayed fo: lysozyme binding 
with the 9E10 antibody (as in Examples 5 and 7). 
Wells with lysczyme-binding activity were iden- 
tified. Two positive wells (of 200) were identified 
from the amplified mouse spleen DNA and one well 
from the human cDNA. The heavy chain variable 
domains were purified on a column of lysozyme- 
Sepharose. The affinity for lysozyme of the clones 
was estimated by fluorescence quench titration as 
>50nM. The affinities of the two clones (VH3 and 
VH8) derived from the mouse genes were also 
estimated by stop flow analysis (ratio of k on .k 0 „} as 
12 nM and 27 nM respectively. Thus both these 
clones have a comparable affinity to the VHLYS 
domain. The encoded amino acid sequences of of 
VH3 and VH8 are given in Figure 16. and that of 
the human variable domain in Figure 17. 

A library of VH domains made from the mouse 
immunised with lyso2yme was screened for both 
lysozyme and keyhole limpet haemocyanin (KLH) 
binding activities. Two thousand colonies were 
toothpicked in groups of five into wells of EUSA 
plates, and the supernatants tested for binding to 
lysozyme coated plates and separately to KLH 
coated plates. Twenty one supernatants were 
shown to have lysozyme binding activities and two 
to have KLH binding activities. A second expres- 
sion library, prepared from a mouse immunised 
with KLH was screened as above. Fourteen super- 
natants had KLH binding activities and a single 
supernatant had lysozyme binding activity. 

This shows that antigen binding activities can 
be prepared from single VH domains, and that 
immunisation facilitates the isolation of these do- 
mains. 



Example 7 

Cloning variable domains with binding activities by 
mutagenesis. 

Taking a single rearranged VH gene, it may be 
possible to derive entirely new antigen binding 
activities by extensively mutating each of the 
CDRs. The mutagenesis might be entirely random, 
or be derived from pre-existing repertoires of 
CDRs. Thus a repertoire of CDR3s might be pre- 
pared as in the preceding examples by using 
"universal" primers based in the flanking se- 
quences, and likewise repertoires of the other 
CDRs (singly or in combination). The CDR reper- 
toires could be stitched into place in the flanking 
framework regions by a variety of recombinant 
DNA techniques. 

CDR3 appears to be the most promising region 



for mutagenesis as CDR3 is mere variable in size 
and sequence than CDRs 1 and 2. This region 
would be expected to make a major contribution to 
antigen binding. The heavy chain variable region 
s (VHLYS) of the ami-lysozyme antibody Di.3 is 
known to make several important contacts in the 
CDR3 region. 

Multiple mutations were made in CDR3. The 
polymerase chain reaction (PCR) and a highly ce- 
io generate primer were used to make the mutations 
and by this means the original sequence of CDR3 
was destroyed, (it would also have been possible 
to construct the mutations in CDR3 by cloning a 
mixed oligonucleotide duplex into restriction sites 
f5 flanking the CDR or by other methods of site- 
directed mutagenesis). Mutants expressing heavy 
chain variable domains with affinities for lysozyme 
were screened and those with improved affinities or 
new specificities were identified. 
20 The source of the heavy chain variable domain 

was an Ml 3 vector containing the VHLYS gene. 
The body of, the sequence encoding the variable 
region was amplified using the polymerase chain 
rea:tion (PCR) with the mutagenic primer VHMUT1 
25 based in CDR3 and the M13 primer which is based 
in the Mi 3 vector backbone. The mutagenic primer 
hypermutates the central four residues of CDR3 
(Arg-Asp-Tyr-Arg). The PCR was carried out lor 25 
cycles on a Techne PHC-1 programmable heat 
30 block using 100 ng single stranded M13mpl9SW0 
template, with 25 pmoi of VHMUTl and the Ml 3 
primer. 0.5 mM each dNTP. 67mM Tris.HCI. pH 
8.8. 10 mM MgCl2. 17 mM (NHO2SO4, 200 ug/ml 
gelatine and 2.5 units Taq polymerase in a final 
35 volume of 50 ul. The temperature regime was 
95* C for 1.5 min. 25* C for 1.5 min and 72* C for 3 
min (However a range of PCR conditions could be 
used). The reaction products were extracted with 
phenol/chloroform; precipitated with ethanol and re* 
40 suspended in 10 mM Tris. HCI and 0.1 mM EDTA, 
pH 8.0. 

The products from the PCR were digested with 
Pstl and BstEII and purified on a 1.5% LGT 
agarose gel in Tris acetate buffer using Geneclean 
45 (Bio 101. LaJolla). The gel purified band was light- 
ed into pSW2HPOLY (Figure 19). (Th'S vector is 
related to pSW2 except that the body of the 
VHLYS gene has been replaced by a polylinker.) 
The vector was first digested with BstEII and Pstl 
so and treated with calf-intestinal phosphatase. Al- 
iquots of the reaction mix were used to transform 
E. coli BMH 71-18 to ampicillin resistance. Colo- 
nies were selected on ampicillin (100 ug/ml) rich 
plates containing glucose at 0.8% w/v. 
55 Colonies resulting from transfection were pic- 

ked in pools of five into two 96 well Corning micro- 
tia plates, containing 200 ul 2 x TY medium and 
100 ul TY medium. 100 ug/ml ampicillin and 1% 
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clucose. The colon^-vere. grown for 24_houiS. at . 
37* C and 'then ceils were Washed twice "in 200 ul 
50 mM NaCl. pelleting the cells in an IEC Centra-3 
bench top centrifuge with microtitre plate head 
fitting. Pistes were spun at 2.500 rpm lor 10 min at 
room temperature. Cells were resuspended in 200 
Ul 2x TY. 100 ug'ml ampicillin and 1 mM IPTG 
(Sigma) to induce expression, and grown for a 
further 24 hr. 

Cells were spun down and the supernatants 
used in ELISA with lysozyme coated plates and 
anti-idiotypic sera (raised in rabbits against the Fv 
fragment of the D1.3 antibody). Bound ant.- 
idiotypic serum was detected using horse radish 
peroxiease conjugated to anti-rabbit sera (ICN Im- 
munochemicals). Seven of the wells gave a positive 
result in the ELISA. These pools were restreaked 
for single colonies which were picked, grown up. 
induced in microtitre plates and rescreened in the 
ELISA as above. Positive clones were grown up at 
the 50 ml scale and expression was induced. Cul- 
ture supernatants were purified as in Example 5 on 
" columns of lysozyme-Sepharose and eluates an- 
alysed on SDS-PAGE and staining with Page Blue 
90 (BDH). On elution of the column with 
diethylamine, bands corresponding to the VHLYS 
mutant domains were identified, but none to the 
VKLYS domains. This suggested that although the 
mutant domains could bind to lysozyme. they 
could no longer associate with the VKYlS domains. 

For seven clones giving a positive reaction in 
ELISA.- plasmids were prepared and the VKLYS 
gene excised by cutting with EcoRl and relating. 
Thus the plasmids should only direct the expres- 
sion of the VHLYS mutants. 1.5 ml cultures were 
grown and induced for expression as above. The 
cells were spun down and supernatant shown to 
bind lysozyme- as above. (Alternatively the am- 
plified mutant VKLYS genes could have been clon- 
ed directly into the pSWlHPOLY vector for expres- 
sion of the mutant activities in the absence of 
VKLYS.) 

An ELISA method was devised in which the 
activities of bacterial supernatants for binding of 
lysozyme (or KLH) were compared. Firstly a vector 
was devised for tagging of the VH domains at its 
C-termina! region with a peptide from the c-myc 
protein which is recognised by a monoclonal anti- 
body 9E10. The vector was derived from pSWt by 
a BstEU and Smal double digest, and ligation of an 
oligonucleotide duplex made from 
5' GTC ACC GTC TCC TCA GAA CAA AAA CTC 
ATC TCA GAA GAG GAT CTG AAT TAA TAA 3 
and 

5 TTA TTA ATT CAG ATC CTC TTC JGA GAT 
GAG TTT TTG TTC TGA GGA GAC G 3. 
The VHLYSMYC protein domain expressed after 
induction was shown to bind to lysozyme and to 



the 9E10 antibody by ELISA as follows: 

(1) Faicon (3912) "st bcnomeC weils were 
coated with ISO ul lysozyme (3 mg/ml) or KLH (50 
Ltg/rnl) per weii in 50 mM NeHC03. pH 9.6. and left 

5 to stand at room temperature overnight; 

(2) The wells were washed with PBS and 
blocked for 2 hrs at 37' C with 200 ul 2% Sains- 
bury's instant dried skimmed milk powder in PBS 
per well; 

ro (3) The Blocking section was discarded, and 

the walls washed out with P3S (3 washes) and 150 
ul test solution (supernatant or purified tagged do- 
main) pipeaed into each we!!. The sample was 
incubated at 37* C for 2 hrs: 
I5 (4) The test solution was discarded, and the 

wells washec out with PES (3 washes). 100 ul of 4 
ug/ml purified 9E10 antibody in 2% Sainsbury's 
instant dried skimmed milk powder in PES was 
added, and incubated at 37* C for 2 hrs; 
20 (5) The 9E10 antibody was discarded, the 

wells washed with PBS (3 washes). 100 ul of 1/500 
dilution of anti-mouse sr.tioody (peroxidase con- 
jugated anti-mouse Ig (Da'<cpats)) was added and 
incubated at 37* C for 2 h:s; 
25 (6) Trie second antibody was discarced and 

wells washed three times with PBS; and 

(7) 100 ul 2.2azino-bis(3-eihylbenz- 
thiazolinesulphonic acid) (Sigma] (0.55 mg.-m!. with 
1 ul 20% hydrogen peroxide: water per 10 ml) was 
30 added to each well and the colour allowed to 
develop for up to 10 minxes at room temperature. 

The reaction was stopped by adding 0.05% 
sodium azide in 50 mM citric acid. pH 4.3. ELISA 
plates were read in an Tiiertek Multiscan plate 

35 reader. 

The activities of the mutant supernatants were 
compared with VHLYS supernatant by competition 
with the VHLYSMYC domain for binding to 
lysozyme. The results show that supernatant from 
ao clone VHLYSMUT59 is more effective than w.ld 
type VHLYS supernatant in competing for VHLYS- 
MYC. Furthermore. Western blots, of SDS-PAGE 
aliquots of supernatant from the VHLYS and 
VHLYSMUT59 domain (using anti-Fv antisera) in- 
45 dicated comparable amounts of the two samples. 
Thus assuming identical amounts of VHLYS and 
VHLYSMUT59. the affinity of the mutant appears lo 
be greater than that of the VHLYS domain. 

To check the affinity of the VHLYSMUT59 do- 
so main directly, the clone was grown at the II scale 
and 200-300 ug purified on lysozyme-Sepharose 
as in Example 5. By fluorescence quench titration 
of samples of VHLYS and VHLYSMUT59. the num- 
ber of binding sites for lysozyme were determ.ned. 
55 The samples of VHLYS and VHLYSMUT59 were 
then compared in the competition ELISA with 
VHLYSMYC over two orders of magnitude. In the 
competition assay each microtitre well contained a 
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constant amount of VHLYSMYC (approximately 0.6 
ug VHLYSMYC). Varying amounts of VHLYS or 
VHLYSMUT59 (3.B uM in Iysc2yme binding sites) 
were added (0.165 - 25 ul). The final volume and 
buffer concentration in all wells was constant. 9E10 
(anti-myc) antibody was used to quantitate bound 
VHLYSMYC in each assay well. The % inhibition of 
VHLYSMYC binding was calculated for each addi- 
tion of VHLYS or VHLYSMUT59. after subtraction 
of background binding. Assays were carried out in 
duplicate. The results indicate that VHLYSMUT59 
has a higher affinity for lysozyme than VHLYS. 

The VHLYSMUT59 gene was sequenced (after 
recloning into Ml 3) and shown to be identical to 
the VHLYS gene except for the central residues of 
COR3 (Arg-Asp-Tyr-Arg). These were replaced by 
Thr-Gln-Arg-Pro: (encoded by ACACAAAGGCCA). 

A library of 2000 mutant VH clones was 
^) screened for lysozyme and also for KLH binding 
{toothpicking 5 colonies per well as described in 
Example 6). Nineteen supernatants were identified 
with lysozyme binding activities and four with KLH 
binding activities. This indicates that new 
specificites and improved affinities can be derived 
by making a random repertoire of CDR3. 



Example 8 

Construction and expression of double domain for 
lysozyme binaing. 

The finding that single domains have excellent 
binding activities should allow the construction of 
strings of domains (concatamers). Thus, multiple 
specificities could be built into the same molecule, 
} allowing binding to different epitopes spaced apart 
by the distance between domain heads. Flexible 
linker regions could be built to space out the do- 
mains. In principle such molecules could be de- 
vised to have exceptional specificity and affinity. 

Two copies of the cloned heavy chain variable 
gene of the Di.3 antibody were linked by a 
nucleotide sequence encoding a flexible linker 
Gly-Gly-Gly-Ala-Pro-AIa-Ala-Ata-Pro-Ala-Gly-Gly- 
Gly- 

(by several steps of cutting, pasting and site di- 
rected mutagenesis) to yield the plasmid pSW3 
(Figure 20). The expression was driven by a lacz 
promoter and the protein was secreted into the 
periplasm via a pelB leader sequence (as de- 
scribed in Example 5 for expression of pSWl and 
PSW2). The protein could be purified to homo- 
geneity on a lysozyme affinity column. On SDS 
polyacrylamide gels, it gave a band of the right 
size (molecular weight about 26.000). The protein 
also bound strongly to lysozyme as detected by 



ELtSA (see Example 5) using anti-idiotypic an- 
tiserum directed against the Fv fragment of the 
D1. 3 antibody to detect the protein. Thus, such 
constructs are readily made and secreted and at 
5 least one of the domains binds to lysozyme. 



Example 9 

jo 

Introduction of cysteine residue at C-terminal end 
. of VHLYS 

A cysteine residue was introduced at the C- 

15 terminus of the VHLYS domain in the vector pSW2. 
The cysteine was introduced by cleavage of the 
vector with the restriction enzymes Bstl and Smal 
(which excises the C-terminal portion of the J seg- 
ment) and ligation of a shon oligonucleotide duplex 

20 5' GTC ACCGTC TCC TCA TGT TAA TAA 3' and 
5* TTA TTA ACA TGA GGA GAC G 3*. 
By purification on an affinity column of lysczyme 
Sepharose it was shown that the VHLYS-Cys do- 
main was expressed in association with the VKLYS 

25 variable domain, but the overall yields were much 
tower than the wild type Fv fragment. Comparison 
of non-reducing and reducing SDS polyacrylamide 
gels of the purified Fv-Cys protein indicated that 
the two VH-Cys domains had become linked 

30 through the introduced cysteine residue. 



Example 10 

35 

Linking of VH domain with enzyme 

Linking of enzyme activities to VH domains 
should be possible by either cloning the enzyme 

40 on either the N-terminal or the C-termina! side of 
the VH domain. Since both partners must' be ac- 
tive, it may be necessary to design a suitable linker 
(see Example 8) between the two domains. For 
secretion of the VH-enzyme fusion, it would be 

4$ preferable to utilise an enzyme which is usually 
secreted. In Figure 21. there is shown the se- 
quence of a fusion of a VH domain with alkaline 
phosphatase. The alkaline phosphatase gene was 
cloned from a plasmid carrying the £. coti alkaline 

so phosphatase gene in a plasmid pEK48 [51 J using 
the polymerase chain reaction. The gene was am- 
plified with the primers 

5' CAC CAC GGT CAC CGT CTC CTC ACG GAC 
ACC AGA AAT GCC TGT TCT G 3' and 
ss 5' GCG AAA ATT CAC TCC CGG GCG CGG TTT 
TAT TTC 3*. The gene was introduced into the 
vector pSW1 by cutting at BstEll and Smal. The 
construction (Figure 21) was expressed in E. coti 
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strain BMH7i.l8^s^«rople 5. and scre_ene-Jor . 
phosphatase activity using 1 mg/mi p-nitrcphenyl- 
phosphate as substrate in lOmM diethanclamme 
and 0.5 mM MgCl*. pH 9.5) and also or, SOS 
potyacrylamide oels which had been Western blot- 
ted (dieting with anti-idiotypic antiserum). No evi- 
dence was lound (or the secretion of the linked 
VHLYS-attaline phosphatase as detected by West- 
ern blc-.s {see Example 5). or lor secretion of 
phosphatase activity. 

However when the construct was transacted 
into a bacterial strain BL21DE3 [52] which is defi- 
cient in proteases, a band of the correct s;ze (as 
well as degraded products) was detected on the 
West°m blots. Furthermore phosphatase activity 
could now be detected in the bacterial supernatant. 
Such activity is not present in supernatant from the 
strain which had not been transfected with the 
construct. 

A variety of linker sequences could men be 
introduced at the BstEM site to improve the spacing 
between the two domains. 



Example 1J_ 

Coexpression of VH domains with Vk repertoire 

A repertoire of V* genes was derived by PCR 
using primers as described in Example 2 from 
DMA prepared from mouse spleen and also from 
mouse spleen mRNA using the primers VK3FOR 
and VK2BACK and a cycle of 94 C for 1 mm. 
60* C for l min. 12 C for 2 min. The PCR am- 
plified DNA was fractionated on the agarose gel. 
the band excised and cloned into a vector which 
carries the VHLYS domain (from the Dl.3 anti- 
body) and a cloning site (Sad and Xhol) for clon- 
ing of the light chain variable domains w.tn a myc 
tail (pSWWHLYS-VKPOLYMYC. Figure 22). 

Clones were screened for lysozyme binding 
activities as described in Examples 5 and 7 via the 
myc tag on the light chain variable domain, as this 
should permit the following kinds of V* domains to 
be identified: . 

(1) those which bind to lysozyme in the 
absence of the VHLYS domain; 

(2) those which associate with the heavy 
chain and make no contribution to binding of 

lysozyme; and 

(3) "those which associate with the heavy 
chain and also contribute to binding of lysozyme 
(either helping or hindering). 

This would not identify those V, domains 
which associated with the VHLYS domain and com- 
pletely abolished its binding to lysozyme. 

In a further experiment, the VHLYS domain was 



placed by tne heavy chain variable domain VH3 
*■" which had been isolated from the repertoire (see 

Example 6). and then the V. domains cloned into 

the vector. (Note that the VH3 domain has an 
5 internal Sad site and this was first removed to 

allow the cloning of the V* repertoire as Sad-Xhol 

fragments.) 

By screening the supernatant using the ELISA 
described in Example 6. bacteria! supernatant will 
to be identified which bind lysozyme. 



Example 12 



High expression of VH domains. 

By screening several clones from a VH library 
derived from a mouse immunised with lysozyme 
20 via a Western blot, using the 9E10 antibody di- 
rected against the peptide tag. one clone was 
noted with very high levels of express.on of tne 
domain (estimated as 25 - 50 mg/l). The done was 
sequenced to determine the nature of the se- 
25 quence. The sequence proved to be closely related 
to that of me VHLYS domain, except w.th a few 
amino acid changes (Figure 23). The result was 
unexpected, and shows that a limited number of 
amino acid changes, perhaps even a single am.no 
30 acid substitution, can cause greatly elevated levels 
ot expression. 

By making mutations of the high expressing 
domain at these residues, it was found that a single 
amino acid change in the VHLYS domain(Asn 35 to 
35 His) is sufficient to cause the domain to be ex- 
pressed at high levels. 



CONCLUSION 
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It can thus be seen that the present invention 
enables the cloning, amplification. and express.on 
of heavy and light chain variable domain encoding 
sequences in a much more simple manner than 
,s was previously possible. It also shows that .soiated 
variable domains or such domains linked to effector 
molecules are unexpectedly useful. 

It will be appreciated that the present invention 
has been described above by way of example only 
so and that variations and modifications may be made 
by the skilled person without departing from the 
scope of the invention. 
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35 Claims 

1 A single domain ligand consisting of at least 
part 01 the variable domain of one cha.n of a 
molecule from the immunoglobulin (lg) ™J»«*™^ 
, 0 2 The ligand ol claim 1. which cons.sts of the 
variable domain of an tg heavy chain. • 

3 The ligand of claim 1. wh.ch consists of the 
variable domain of an lg chain with one or more 
point mutations from the natural sequence. 
, s ' 4. A receptor comprising a ligand 0 any one 0 
claims 1 to 3 linked to one or more of an effector 
molTcule. a prosthetic group, a Mabel, a sol* sup- 
port or one or more other tigands havmg the same 

50 claim 4. comprising at least 

"°e 9 The' receptor of claim 5. -f^dthe 
ti gand binds to a first epitope of an ant.gen and the 
second ligand binds to a second epitope- 

7 The receptor of claim 6. which includes an 
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ecule. proceed by^combinanL DNA technology, - 
25 a fusion product. 

9. The receptor of claim 8. wherein a linker 
peptide sequence is placed between the ligand and 
the other protein molecule. 

10. A method of cloning a sequence (the target 
sequence) which encodes at least part of the vari- 
able domain of an lg superfamily molecule, which 
method comprises: 

(a) providing a sample of double stranded 
(ds) nucleic acid which contains the target se- 
quence; 

(b) denaturing the sample so as to separate 
the two strands; 

(c> annealing to the sample a forward and a 
back oligonucleotide primer, the forward primer be; 
ing specific for a sequence at or adjacent the 3 
end of the sense strand of the target sequence, the 
back primer being specific for a sequence at or _ 
adjacent the 3' end of the antisense strand of the 
target sequence, under conditions which allow the 
primers to hybridise to the nucleic acid at or adja- 
cent the target sequence; 

(d) treating the annealed sample with a DNA 
polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which 
cause primer extension to take place; and 

(e) denaturing the sample under conditions 
such that the extended primers become separated 
from the target sequence. 

11. The method of claim 10. further including 
the step {0 of repeating steps (c) to (e) on the 
denatured mixture a plurality of times. 

12. The method of claim 10 or claim 11. which 
is used to clone a complete variable domain from 
an lg heavy chain. 

13. The method of claim 10 or claim 11 which 
is used to produce a DNA sequence encoding a 
ligand according to any one of claims 1 to 3. 

14. The method of any one of claims 10 to 13. 
wherein the forward and back primers are provided 
as single oligonucleotides. 

15. The method of any one of claims 10 to 13. 
wherein the forward and back primers are each 
supplied as a mixture of closely related 
oligonucleotides. 

16. The method of claim 14 or claim 15. 
wherein the primers which are used are species 
specific general primers. 

17. The method of any one of claims 10 to 16. 
wherein the ds nucleic acid sequence is genomic 
DNA. 

18. The method cf any one of claims 10 to 17. 
wherein the ds nucleic acid is derived from a 
human. 

19. The method of any one of claims 10 to 18. 
wherein the ds nucleic acid is derived from periph- 
eral blood lymphocytes. 



. 20. The method of ar.y one of claims 10 to 18. 

wherein each primer includes a sequence encoding 
a restriction enzyme recognition site. 

21. The methoo cf claim 20. wherein the re- 
5 striction enzyme recognition site is located in the 

sequence which is annealed to tne cs nucleic acid. 

22. The method of any one of claims 10 to 21, 
wherein the product ds cDNA is inserted into an 
expression vector and expressed atone. 

io 23. The method of any one of claims 10 to 22. 

wherein the product ds cDNA is expressed in com- 
bination with a complementary variable domain. 

24. The method of any one of claims 10 to 23. 
wherein the cloned ds cDNA is inserted into an 

is expression vector already containing sequences 
encoding one or more constant domains tc allow 
the vector to express ig-type chains. 

25. The method of any one of claims 10 to 24. 
wherein the cloned ds cDNA is inserted into an 

20 expression vector so that it can be expressed as a 
fusion protein. 

26. The method of claim 10. wherein one or 
both of the primers comprises a mixture of 
oligonucleotides of hypervariabie sequence, where- 

25 by a mixture of variable domain encoding se- 
quences is produced. 

27. A method of cloning a sequence (the target 
sequence) which encodes at least part of the vari- 
able domain of an lg superfamily molecule, which 

jo method comprises: 

(a) providing a sample of double stranded 
(ds) nucleic acid which contains the target se- 
quence; 

(b) denaturing the sample so as to separate 

35 the two strands; 

(c) annealing to the sample a forward and a 
back oligonucleotide primer, the forward primer be; 
ing specific for a sequence at or adjacent the 3 
end of the sense strand of the target sequence, the 

40 back primer being specific for a sequence at or 
adjacent the 3' end of the antisense strand of the 
target sequence, under conditions which allow the 
primers to hybridise to the nucleic acid at or adja- 
cent the target sequence; 

45 (d) treating the annealed sample with a DNA 

polymerase enzyme in the presence of deox- 
ynucleoside triphosphates under conditions which 
cause primer extension to take place; 

(g) treating the sample of ds cDNA with 
so traces of DNAse in the presence of DNA poly- 
merase I to allow nick translation of the DNA; and 

(h) cloning the ds cDNA into a vector. 

28. The method of claim 27, which further 
includes the steps of: 
55 (i) digesting the DNA of recombinant plas* 

mids to release DNA fragments containing genes 
encoding variable domains; and 

(j) treating the fragments in a further set of 
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steps (c) to (h). 

29 The method of either claim 27 cr claim 2B. 
wherein the fragments are separated from the vec- 
tor and from other fragments of the incorrect size 
by gel electrophoresis. 

30 The method of any one of claims 27 to 29. 
wherein the product ds cDNA is cloned directly into 
an expression vector. 

31. A species specific general oligonucleotide 
primer or mixture of such primers useful lor cloning 
at least part of a variable domain encoding se- 
quence from an animal of that species. 

32. A primer or mixture of primers according to 
claim 27, wherein each primer includes a restriction 
enzyme recognition site within the sequence which 
anneals to the coding part of the variable domain 
encoding sequence. 
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VKPCR1. 



HirD III"' 

ALcT-AT^TATGCAAA-CCTCTGAATCTACATGGTAAATATAGGTTrGTCTATAC^ 
10 20 30 <0 50 60 

CAAA^GAA**CATGAGATC^^ 

70 80 90 100 110 120 

HGWSCIIi-FLVATAT 
CATGGGATGGAGCTGTATCATCCTCTTCTTOT^ 

130 140 150 160 1/0 ISO 

AGTAGCAGGCTTGAGGTCTGGACATATATATC^T^^TCACATC^CTTTKCTTTC 
190 200 210 220 230 

PstI 

1 51 10 

GV^'SOVQLQESGPGLVRP 

TCTCCACAGGK^CTCCCAK^^ 

250 260 270 .280 290 300 

CDR1 

t c 20 2S 30 

S OTLS L^CTVSG'STFSSirW« 
CTAGCCAGACCCTGAGCCTGACCTGCACCG 

CDR2 

35 40 45 50 

HWVROPP GRGLEWIGRI D P N 

tt^cactgWgagaca^cacct^ 

370 330 390 400 410 420 

„ go 65 70 

S G G T K Y » £ K F K S K V T H l V D T 

ATAGTGCCGGTACTAAGTACAAT^GAAGTT^^ 

430 440 450 460 470 480 

? S K N Q F 8 S L R L S 8 S V T A A 'D T A V Y 
CCAGCAAGAACCAGTTCAGCCTGAGACTCAGCAGCGTGACAGCCGCCG^ 

490 500 510 520 WO ?™ 

CDR3 

« 100 105 HO „ „ 

Y C A R * D Y Y G S S Y F D Y W G Q G T 
ATTATTGTGCAAGATACGATTACTACGGTAGTAGCTACTTTGACXACTGGGGC 

550 560 510 580 590 600 

BstSIl 
115 I 120 

ccaJgztlaIo^ 

610 620 630 640 650 660 

AGATTTTACTGCATTTGTTGGGGGGGAAATGTGTCTAT 

670 680 690 ?00 ' zo 

CTAGGGACACCTTGGGAGTCAGAAAGGGTCATTGGGAGCCCGGGCTGATC 

730 740 750 760 770 780 

BamHI 
I 

TCCTCAGC TCCC AGACTTCATGGCCAGAGATTTATAG F I "3 

790 800 810 ' 
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^ WPCR1 



~TGAATCTACATGGTAAATA7AGC-7^TC?ATACCA 
58 68 78 68 



CAAACAGAAAAACA7SAGAT:*CAG77CTC^^ 
96 1CB 




Vr-r - rrr • < — ~^TATCIA^CTCTTCTTGGTAjCAACA5CTACA3GTAAGGGGCTCAC 
MC ^i^-" A ^^ 176 185 158- 208 



AGTAGCAGGCTTGAGGTCTGGACATATATAT^GTGACAA^ 

218 228 238 248 258 ZcS 



I 

GVHSDIQLTQSP S S L S 
TCTCCACAGCTTGrCCACTCCGACATC-^^CCCAGAGCCCAA^o.C * — 

276 289 299 308 JlB 3ZB 

CDR1 

15 20 25 30 

VGDRVi-ITCBASCNI** * 
GCGTGGGTGACAGAGTGACCATCACCTGTAGAGCCAGCGGTAACATCtt 

339 349 358 368 J '*» JO ° 

CDK2 

393 . <08 <ie 428 438 

55 60 65 70 

InGVPS^rSGSGSGTDFTFT 

CDR3 

75 80 85 90 r k 5 r 

CCATCA5CAGCC TCCAGC^GAGGACATCGCCACCTACTACTG CCAG^CTTCTG^ 
518 528 538 548 558 

Bel I (requires dan" host) 
I 

95 100 105 108 

Pft'rFGOGTKVVIKR 

CCCCAAGGACGTTCGGCCAAGGGACCAAGGTGGIXILIC^ 

578 588 598 608 618 



Ba.-n.HI 
I 

rTGCTTCCTCAGTTC-gnTCC C" I C 

638 648 r\\J* J 
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Sequence of K3rl VK 

Srliee -I 
i G V K S 
A33TGTCCAC7CC 

1 ?stl 10 20 

Q V Q L Q ESGTELASPGASVTJ, 
CAGGTCC^ACTGCA^GAGTC^GG AACTGAGCTGGCGAGTCCTGGG^^TCAGTGACA^ - G 
VK1BACX SITE 

30 CDR1 , <0 

SCKAS GYTF T I D K 3 I^D K V K K R 
TCCTGCAAGGC 7TCTGGCTACACAT TTACTGACCATATTATAAATTGGGT AAAAAAG.-V&J 

52a 53 OR2 . 

PGQGLEKIG |* i Y P V S G V T N Y| 
CCTGGACAGGGCCTTGAGTGGATTGGAAGGATT7ATCCAGTAAGTGGTGTAACTAACTAC 

60 COR2 65 ~>0 

In o k f y. G i katfsvdrssntvy 

AATOVAWTTCATGGGCAAGGCCACATTCTCrGTAGACCGGTCCTCCAACACAGTC-iAC 

80 . 82A 5 C 83 90 CDW 

MVLNSLTSEDPAVVYCGR L5_rJ 
ATGGTGTTGAACAGTCTGACATCTGAGGACCCTGCTGTCTATTACTGtGGAAGOGGCTTr 

CDR3 1 03 Estill Splice 

I D F D 71 W G 0 G T T V T V S S J, 

rATTTTGACTAC TGGGGCCAAGGGACCACGGTCACCG TCTCCTC^GGr 

VH1FOR SITE 



Sequence of MBrl VK 

Splice -1 
; G V K S 
A3GTGTCCAC7CC 

i pwii io 2 £ 

D I Q L TQSPPSLTVSVGERV^ 
GACATTC^GCTGACCCAGTCTCCn CCATCCCTC^CTGTGTCAGTAGGAGAGAGGGTCACr 

VK13ACX SITE 

21\ 3 C D E T CDR1 

I S C IxSNONLLWSGN RR-YCtGl 
ATCAGTTGCAAATCCAATCAGAATCTTTTATGGAGTGGAAACCGAAGGTACrGTTTGGGC 

35 <0 50 CDR2 

WHQWKPGQTPTPLIT \ W T S D R ) 
TGGCACCAGTGGAAACCAGGGCAAACTCCTACACCGTTGATCACCTGGACATCTGATAGG 

60 ™ 

I F Si GVPDRFIGSGSVTDFTLT 
TTCTCTGGAGTCCCTGATCGTTTCATAGGCAGTGGATCTGTGACAGATTTCACTCTGACC 
80 90 CDR3 

ISSVQAEDVAVYFC O I Q H L D M 
ATCAGCAGTGTGCAGGCTGAAGATGTGGCAGTTTATTTCTGTCAGCAACAlTrGGACCTT 

95 100 Sglll/Bcll Splice 

I P Y Tl F G G G T K L E I X J 
CCGTACACGTTCGGAGG GGGGACCAAGCTGGAGATCAAACG TGAG 
VK1FOR SITE 



FIG. 6 



EP 0 368 Q34 A> 



oc-Lys 30 




BamHI 
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a-Lys 17 




(BamHI) J 



3 EcoRI sites internal 
10 kappa fragment 



2.9 
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Hindm PstI BstEH BamHI 



BamHI 
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HB_1 



rnR l 



UL2 



rrm 2 



KABAT ia 

A*7 ?GLVX?S?SLSL7C5VTGYS:7 

r03 PGLVX?SCSLSLTCS\TGYSiT 
G01 PCLVK?£;SLSLTCSV73ys:7 



SSYYVN 

sgyywi 

SGYYW 
SGYYW 



KISOFPGSXLEKXG yiSYCGSN'NYNPS-XN 

K S?G X? LZ'.^XG YI7HS 5 E7F Y NPS LCS 

*.: IKF PCSXLZVX3 Y i s yogsnkynpslxn 

WIRCFFGNKlZVttG YISYDCSiCNYNPSLXN 



KABAT IB 

ACS FVIA/A?SCS1,SI7CAVS0FSL7 

^3° PGLVA? S~S LSI 7C7VSGF S IT 

G03 PGLVO?s;5LS:TCTVSGrSLT 

nog pvlva??csls:tctvsg?elt 

25C10 ?GLVA?S3SLS;7C7VSG:SL7 

U2 PGLVAPSSSLSITCTVSSrSLT 

AC° PGLVAPSCS'-SITCTVSGFSLT 

25GC6 FGLVAPSCS T -5ITCT\'SGFSLr 

AO 3 FGLVCPSOSLS-TCTVSGFSLT 

CO 7 PVLVA?S3SLS:7CTVSG-SLT 

-D< FGLVA?SCS*-S:tC~ f SGrSL7 



SYGVL 
SYGVH 
SYCVTj 
SYGVK 



SYAIS 

syais 



SY7AO 



SYGVK 
SYGVD 



KV3a?FCXGL£WLG 



VIWAGGITttYNSAl-MS 
VIWSGGSTDYNAAFIS 
3S7NYNSALMS 
G5TDYNAAFIS 
VIKAGGSTNYNSALMS 
V;W7GGG7N*YSSAi.XS 
VIVCGGGTSYNSA'-KS 

csrnvsAiits 

VIXGGGSTSYNSALK5 
VIVfSGGSTSY^^AFIS 
VIWAGGSTSYSSALX5 
VIKGVGS7KY SSALKS 



KABAT IIA 



K57 



?IL'. ! 7<?GVSV;<:SCKGSGY7r7 
K"-VTv?GVSVK: 5CKGS3Y7F7 



CYAXH 



wvxoskaxslewig 
kvkcshaxsuewig 



vistyygdasyncxfkc 
vistyygdastfssxtkc 



KABAT JIB 

ac 2 ailwpc-asv:<i.s;xasgy7?: SYWn 

EOi AILAW?C-ASVK>:SCKASGYTF7 SYh.. 

COS AiLVKPGASVKLSCKASGYrrT 

COS AiLVKPGA5LXLSCKA5GY7F7 

DC 6 ASLVXPGASVKKSCXASGY7F7 

DC3 PELVX?GASV-aSCKASGYT:7 

EC 7 ;£LVH?GASVyiSCKA$GY?~? 

GOB pZLVKPGASVKI SCKASGYTF7 

C-iC AELVKrGASVKVSCKASGYTiT 

2S5C9 AELVfSCASVjKSCXASGY*.-- 

ro< 7ILVKPC-ASVKLSCXASGY7: 7 

*02 AZLVX?GASViC_5CXASGY7r7 

HC1 A£LVM?GA5VKL5CKASGY7F7 

2=CC5 pELVsl?G?SV!C«CXASGYT7F 

EOi A£LVK?GASV:-^3CKASGYTF7 

bos aelv^gssvxlsckdsyfaf:-: 

AILVK?GAS\':-C.SGKASGY7F- SYfcIT 



SYWti 

sywit 
sykxk 

CYEKK 
DYYIN 



SYVJXH 
5YW.H 
SYWK 



SYKIT 



WVSOaFGCGLEKIG 
hViCOaPGRGLEWIS 

kvxsb?G3GUh:g 

KVXCflPGQGLEWXG 
KVKOaFGQGLEWIG 
KVKCTPVHGLEifIG 
WVKCR?5QGIXKIG 

W.'XCNHGXSLEMIG 

KVXOaPGOGLEWIS 
KVXQSF-GQGLIKIG 

KVKQaPSQGlIMG 
KVXCRPGiiGLEKIG 
WXOSFGOGL&CG 



eidpsdsytsysoxfkg 

CIY?G5GS7S'Y:;t:<FKS 
RIDPNSGG7KYSEXFKS 
r:NPSKGG7NY0E:K7K5 
DIYPGSGSTNYNEXFXS 
IINPSKGGTNYNIXFK5 
AlDPrTGGTAYKOKFKG 
KI YPGSGN7KYNEKFKG 

riepsdsdtnykaktkg 
nfkpysd3tkykekfk5 
n:n?ssggtnys*3Xfks 
nidpsdsdthysckfkd 

tZ3?SDSYTr;YN'KVCC 
QZFPASGSlYYNSiXKD 
C I Y?G S 3 S7H YKIK7KS 
S F7XY 50A7E Y SENFXG 
DIYPGSGSTNYNEXTKS 



KABAT III A 



2SG05 

C1C 

E07 



GGLV3.WGS LSLSGAASGFTF7 
CG1VQPGGSLSLSCAA5GFTF7 



DYYKS 
DYYtC* 
OYYMS 



KV?Q??GKA1XV1LG 
WVHCP?GXA1X«LA 
KVR3?PGXAL£K'_A 



"75NKAKGY7TEYSASVKG 
L I "rkkang Y TME Y SASVKG 
LIBNKASGYTTEYSASVXG 



KADAT III D 

CCS GGLVK?GGSLK*-SCAASGF7FS 

E12 CGLVQPGESLKL5CE5NEYEF? 

00* GGLV0?GGSLRLSGAASGF7rS 

D05 GGLVQPGG S LRISCAASGF 77 S 

Fi2 GGLVQPGI5WKLSCVICQ"" 

TO 6 GGLVQPGSSLRLSCAASCFTFS 

D02 GGLVOPGESLKiSCESNEYVI? 

i09 GDIVVPGGSUGSCAASGFTFS 



OYG!<H 
SHDXS 
SYAM3 
SYAMS 

trrtt 

SYAXS 
•KDKS 
SYGKS 



W«3APEKGUSVA 

*W» *« 

KVA'APGXGIT^S 
WA*APGKGL£WVS 

WK'APGKGLEUVS 
hVRGDSSE'ISLVA 
W^?QtFOK?.LEWA 



vTSSGSSTIYYAOTVKG 
AZtJSDGCSTYYPDTMER 
AJSCSGGS TYYADSVKG 
AISGSCCSTYYADSVKC 
AINSDGGSTYrPDTtGR 
AISCSGGSTYYADSAXG 
AIXSDGSSTYYPDIT&I 
T I S SGC S YTYYPD SVKG 



KABAT III C 

E06 GGLVOPGGSM-aSCAASGFTFS 



WKQ ^OSPEKGU:™ E I SNXANN HAT Y Y AEiS VXG 



KABAT V A 

C0( AZ1VKFGASVKLSCKASGY 



EY7IH VsVKQRSGQGLEWIG 



FIG. 10a 



KTYPGSGSIKYSEKFKD 



EP (>368 £84 A\ 



EK_2 



p : s : i rit s :-:N:-r r lcln Sv-rrr c7 axy yc ag d.-ok l ^t.^x 

RIS ITRCTSKNC'rrLr-I^SVTTi^ATYY'AR VSSGYtSKDY 



RLSIS:O7S:<SQV7L:0C)Sw7CC7AyYYCAK 
RL5 I S KDS'SV.SOVrrKXNS lqadcta: yycar 

W-S ! SiONSKSQV-riC^S -QADCTA^ ?VCA3 
RL5 I SrONSKSQV? LK>^S LQ7DD7A. W .Y YCA I 
RLSISKDN£K5a\TLK:^SLCTDC7ASryCA^ 
RLSISKDN5 KSO'^T S L07D C 7ARY YCAR 

RLS I SaDN ; LKXN SLQ7D" 7 A'Of YC AR 

RLSISKDN3KS0VTrv:^3LaACD7A:YrCAR 
RLS;5KDNSKSQV-LKW5LC7CD7A:^YCAK 
K1S IS-ONSKS^VT LK>:NS LC7CC7A.VYYCAS 



LGSGYA>CY 
KR2YCY3SCYYYAMDY 
YYvGSFfAY 
IGYYYrAY 
IYYCGSSDYYAVSY 
13 nt 
21 nt 
2B r.t 
37 nt 
32 nt 



f/U-crcd-JCii* 
:ive 
:ive 
Uixrcsustive 
Ur.procucLive 



KATI-TTVSy.S S S7AY:-2 L^LTSE:: S AVYYCAR 
KA7^^K5SSTAYr£LARL7SEDSAVYYCAS 



AO r.t. 
22 nt. 



Unproductive 
U.-.Drca'Jctlve 



KAT LTVC K £ S S 7AYM7 LS S LT S E 2 S AVY YCVR RGL7Y A-*OY 

KA7LTVSTS3STAYX5LSSLTSICSAVYYCAR YYSN'YTDY 

KA7 L7VDK? £ S TA XKZ ^ S L7S S A\,^ Y C AR ?N***SKY Y YOOV 

KATL7VCKS53TAYKC-LS5LTSID5AVYrC?- LYYYA.*CY 

KA7UVCTS S S TAYXCi-S S L75 ID 5 AVY Y C AR SSGYDY 

KA7LTVDKSSS7AYXC;LSS1/:SIDSAVYYCTI GAARA7MAY 

KA7L7VDK 5 5 STAYKCLS SITS I2SAVYYCAR GGrAY 
KATLTVET5SS7AYKC15S L7S EE? 3 AVY YCAR 
KATLTVOKSSSTAY?^LSSL?StDSAVYYCAI 

KA7LTVTKSSS"^1I1SRLTS5D5AVYY:aR iOYYGSSLfc;AY 

KATL7VD K5 S57AYK3LS S L7 S 10 SAVY YCAX TTWATDY 

KAT LTVDKS S 5TAYK3LS 5 LIS 13 5 AVY YCAR -CC YSTYrCH 

rjX^/uTSSSTArAlSSLTSLOTAVrrCL' 2« .it. 

KATLWCKJ'5D7AYX71.SSL7SH:DSASYYCAR 9 r.t. 

KATLTANTSS5TAYXZLS5L7SEDSA\fYYCAR 23 r.t. 

KA7L7VDTSSS7SYKCLS5L7SEDSA l /YYCAR 15 




RT7I SRDN'SOS ■ L Y !C W ALRAI DSA7 YYC AR YXI LGAXD Y 
RrriSR2NSQS:LY:C:W.LRAiDSA7YYCAR CYYYDSSYYAJGY 
Rr7I5RDNSCSILY-_^'ALRA£DSA7YYCAR 23 r.t. 



Ur.prod'jcclve 



^7IS?rXA, , or:'_rLC:<TSLRSi:D7A>1YYCAR AXTKLYFDY 

PS1ISRDUTKK Tl Yl^VSSlRSZQ TAX YYCAR RLCWZSRLDCDV 

RfTISFlDNSKNTL YL ~-£VJ LFAEC TA VY YCAD ZZLHV'TD? 

KF TI SRDSSKNTL YLQWS LPAZD TA VTYZAK WXSSfTOY 

RniSRWSKKTLYLCXSSUlSZDTAL YYCAR PPXtPSY 

RTT*SXDH$KS7LYLC?&>SLPA£DTAVYYCAK 43 nt. 

RFIlS&tm&TLYLCXSSUlSZDTALYYCAX 29 nt. 

RTT I S ROHAJC»*TL Y LC"& S LKS ZD7AMY YC AR 35 nt. 



Ps.gene 

Ps.cene 

Ps.gene 

Ps.9«ne/i 

Ps.ger.e/ 



R77:SRDD£KSRVYLCMNSLRA£D7GIYYCTG 



3D nt. 



Unproductive 



KA7*TA.DK55STVYXI'_SRL75D0SAVYFCAR KIDRDSSGYAXDY 



FIG. 10b 



EP 0 368 6&4 A1*. 



CD?, 2 



3 



KASAT HUMAN VEl 



STSTAYf"ELRSLRSED7AWYCAR GEGWDI1TDY 
KAOKFGG RVTIRRHXSTSTAYMELSSLRSEDTAVYYCAR GSRYGYDCSGYYYL 
G*m5w R^rTRN-SISTATKELSSLRSEDTAVYYCAH LAKFSGSPVWTDP 



KASAT HUMAN VH2 



KKQLQPSLKS . RV7ISV07SKNQFSLKLSS"»TAADTAVYYCAR GGWPAAIKDV 

KS R^ISVSTSKNOFSLKLSS^AADTAVYYCAK £^?pSS™ 

SLKS RLSISODTSRKQFSLRLSS%*TAADTAVYYCAR hRNW«SPVHFDY 

ESTSTAYKELSSLRSEDTAVYYCAR DSYGDYGGHY 



KASA7 HUMAN VH3 

ISYI7SSSSY7NYADSVKG 
SVKC 
YADSVKG 
YYAOSVRD 
DSVKG 

VSAISGSGGS7Y YADSVKG 
AVI S YDGSNKYYADSVKG 
GAVI S YDGSKKY YADSVKG 

QYAASVKG 



RrriSF^NAKNSLYLQ^JSI^ATDiAVYYCAR 
RTTISRDDSXS LAY LQVN S LK7ED7AVY YCTR 
RFTISR3NAKNSLFL0MSSLRAZD7ATYYCAR 
RT7IS?i;NSKN7LYLQ^SLRAED7AVYYCAK 
RF7 I S RON AXN SLYLQMNS LROED7AVY YCAR 
Rr7ISRONPKN7LYLQW5LRSED7AVYYCAR 
RFTI S RONS K*TLYLQMN5 LRA1D7AVY YCAR 
PiTISROICSKNTLYLQMNSLPAIDTAVYYCAS 
AKNSLYLQMNSLRAZD7AVYYCVR 
RF7I SRDDSKNSLYLQ^SLNTEDTAVYYCVR 



DGRFG7YSPSDY 
7IYYDSSGYPYW • 
GIALDAFDI 

53 NT. UNPROD PZARR 

DHSG7GGGGSGSYF 

KDNLKFD? 

DLGGRGWWPAPGGR5 1 YY YGMOV 

LEGIG7IYYYGKDV 

DDSSSKPKHFQK 

SGWPYLDY 



KNOWN FAMILY 



AVYYCAR DPRIAARPDYYYYMOV 
TAMYYCAR GAEWt ? TARY Y YC LNV 



FIG. 11 



EP 0 S68 664 A1- 



. CP? 1 . 



YTFT SYGIS 

GEKPGSSVTV5CKASGY7F7 DYFKN 

QVQLQEJGPRTGEASETL.SLICAVSGDSIS SGNW'I 

QVQLQESG?GLVK*SETLSL7CTVSGGSIS syyws 

GYTTT NYCMH 

QVQLQESGPGLVKnSETLSLYCAVSGDSIS SC-NW* I 

GPRLGEASETLSLTCTVSGGSIS SSSYYw 

QVQLQiSGPGLVKoSETLSLTCTVSGGSIS SYYWS 

LS1ICAVSGSSIS SGNK-I 

SE7LSLTCAVYGGSFS GYYWS 

QVQLVQSGAZVrKPGASVKVSCKASGYXFT NYCKK 

SETLSLICAVSGDSIS SGNW*I 

SRAQ7GEASETLSL7CTVSGGSIS SSSYYWG 

CPLTCTVSGGSVSSGS YYWS 

GLVKPSETLSLTCTVSGGSIS SYYWS 

SFETLSLICAVSGDSIS SGNW*I 

QVQLVQS G AE\ 7 KK? G S S VKV S C KA S GGTF S SYAIS ■ 

QVQLC^'GAGILKPSETLSLTCAVYGGSFS GYYWS 

QLQLQESGPG1VKPSE7LSL7C7V5GGSI S SSSYYWG 

GPGLVKPSQTLSLTCTVSGGSIS SGGYYKS 



WVTTGPWTRDLRWXG 

wmrqa?gqrlzwi*:g 
wvrqppgkglewig 

WIrcppGKGLEWIG 
WV-RODKAQGLr^MG 
WVRQPFGKGLEWIG 
WIRQPPGKGLEWIG 
WIRQPPGKGLENIG 
WVRQPPGKGLEWIG 
KIRQPPGKGLEWIG 
WVRQVLAQGLEWMG 
KVRQPPGXGLEWI G 
WIRQPPGKGLEKIG 
WIRQPPGKGLEWIG 
WIGSPpGKGLEWIG 

kvrqpfgkglexig 
kvrqap gqglewmg 
wircppgkglewig 
xi rqppgkglewi g 
wirqnpgxglewig 



* indicates stoo cocor. ( unsure as sequence regains in frame) 

• sequence terminates due to internal restriction site 
lower case denotes frane shift ^^^J 



CDR2 , 



! WI S A YNGNTNY AQKXQG 
WINAGNGNTKY SQKLQG 
EIHHSGSTYYWPSLKS 
RIYTSGS7NYNPSLKS 
LVCPSDGSTSYAQKFQA 
EIHHSGSTYYNPSLKS 
EINHSGS7NYNPSLKS 
YIYYSGSTNYNPSLKS 
EIHHSGSTYYNPSLKS 
EINHSGSTNYNPSLKS 
LVCPSDGSTSYAQKFQA 
EIHHSGSTYYNPSLKS 
SIYYSG5TYYNPSLKS 
YIYYSGSTNYNPSLKS 
RIYTSGSTKYNPSLK5 
EIHHSGSTYYNPSLKS 
R2IPILGIANYAQKFQG 
EINHSGSTNYNPSLKS 
E INHSGSTNYNPSLKS 
YIYYSGSTYYNPSLKS 



RVTMTTD7STS7AYMELRSLRSDDTAVYYCAR 

RV7ITRD7SAS7AYXQLSSLRSF-DTAVYYCAR 

RITMSVDTSKNQFYLKLSS • 

RVTI SVDTSKNQFSLKLSSVTAADTAVYYCAR 

RV7ITRDTSMSTAYMELSSLRSEDTAMYYCAR 

RITMSVDTSKNQFYLKLSS- 

RVTI SVDTSKNQF SLKLS S ■ 

RVTI SVDTS KNQF S LKLS S • 

RITMSVDTSKNQFYLKLSS • 

RVTISVDTSKNQFSLKLSSVTAADTAVYYCAR 

RVTI7RDTSMS7AYMELSSLRSEDTAMYYCAR 

RITMSVDTSKNQFYLKLSS • 

RVTIPVDTSKNQFSLKLSS • 

RVTI SVDTSKNQFSLKLSSVTAADTAVYYCAR 

RVTMSVDT S KNQF SLKLS S • 

RITMSVDTSKNQFYLKLSS • 

RV7I7ADKSTSTAYMELSSLRSEDTAVYYCAR 

RVTI SVDTSKNQF SLKLS S • 
RVTI SVDTSKNQF SLKLSS • 
RVTI SVDTSKNQFSLKLS SVTAADTAVYYCAR 



DTVS3 
DTVSS 

DTVSS 
DTVSS 



DTVSS 
DTVSS 



DTVSS 

DTVS 

DTVSS 



FIG. 12 
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pSWl 

KincIII site AAGC77 

MKYLLPTAA 
GCATGCW.TTCTATTTC^GGAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCC 
10 20 30 «0 50 60 

AGLLL* AAQPAMAQVQLQES 
GCTGGATTG7TATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 HO 120 

GPGLVAPSQSLSI-TC T V S G ? 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCA.CG^-AG««x.C 

120 HO 150 160 170 xBO 



SLTGYGVKWVRQPPGKGL E W 
rCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGrtGTGo 
a90 200 210 220 230 240 



T r-wTWGDGNTDVNSALKSRL 
CTGGGAATGATTTGGGGTGA7GGAAACACAGACTATAATTCAGCTCTCAAATC 

250 260 270 2B0 290 300 



SISKDNSKSQVFLKMN S L H T 
AGCA7CAGCAAGGACAACTCCAAGAGCCAAG77T7C7TAAAAATGAACAGT 

310 320 330 340 350 360 

DD-A'RYYCARERDYRLDYW G 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCTTGACTACTGGGGC 

370 380 390 400 410 420 

QGT' TV'f'VSS Smal 
CAAGGCACCACGGTCACCGTCTCCTCATA^TAAGAGCTATOXGi^CTAAGCTCGAATTC 

430 440 <50 460 470 480 



FIG. 13 



EP 0 368 6S4 A1 



pSW2 

Hindi I I AAGCTT 

MKYLLPTAA 
GCATGCAAATTCTATTTCAAGGAGACAGTCAT^^ 

10 20 30 40 50 *>u 

AGLLL - AAQPAMAQVQLQES 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGA7GGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 -L^u 

GPG'LVAPSQSLSIT C X V S G T 
GGACCTGGCCTGGTGGCGCCCTCACACnGCCTGTCCATCACATGvAw^GiCiCrtoooTTu 

130 140 150 160 170 180 

SL T GYGVNWVKQFPGKGLEW 

TCATTAACCGGCTATGGTGTAAAiCTGGGT^ 

!90 2C0 210 220 230 240 

CTSXDKS-<SQVFLKMNSLKT 

Ag\*TCAgWaC^^ 

310 320 330 340 . 350 3o0 

DDTARYYCAKERDYRLDYW G 
GATGACACAGCCAGGTACTACTGrGCCAGAGAGAGAGATTATAGGCrTGACTACT^Go. 

370 380 3S0 400 410 420 

CAAGGCACCACGG^ 

430 440 450 460 470 

MKYLLPTAAAG 
AAATTCTATTTCAAGGAGACAGTCAT^TGWTACCTATTGCCTACGGCAGCCGCTGGA 

490 500 510 520 530 540 

LLLLAAQPAMADIVLTQSPA 
TTGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGTCCTGACTCAGTCTCCAGCC 
550 560 570 580 590 600 

slsasvg-tvtitcrasgni 

TCCCTTTCTGCG7CTGTGGGAGAAACTGTCACCATCACATGTCGAGCAAGTGGGAATATT 
610 620 630 640 650 660 

HN V LAWYQQKQGKSPQ LLVY 
CACAATTATT7AGCATGGTATCAGCAGAAACAGGGAAAATCTCC7CAGCTCCTGGTCTAT 

670 680 690 700 710 720 



FIG. 14a 



EP 0 358 68& A1 



V--TLADGVPSRFSGSGSGT 



cLtIttctctcaLa^cagcctgc^^ 



TAAGAG CTCGrtATTC 
910 

FIG-.lAb 



pSV;lH?OLVMYC 

Hindi I I site AAGCTT 

gwtgc^ttctatttcaaggagacmtcat^^tLctattgcct^sgcagcc 

10 ' 20 30 <0 3" 

Polylinkcr 
TCTAGA GTCGAC CTCGAG 
Xbal Sail xhol 

MYC PEPTIDE ^ # 

BstEII 

GGGCTAAGCTCGAATTC 

FIG. 15 



EP 0 368 654 AT . . * *♦ 



YE 3 QVQLQESGPELVKPGASVKMSCKASGYTFT 
VH8 QVQLQESGPSLVKPGASVKMSCKASGYTFT 
VH-D1 3 QVQLKESGPGLVAPSQSLSITCTVSGFSLT 



VH3 
VK8 

VH-D1 .3 



50 



CDR2 



CDR1 

SYVMH 
SYVMH 
GYGW 



YINPYNDGTKYNEXFKG 
YINPYNDGSKYNEKFKG 
MI VI GOGNTDYNSALKS 



t t i 



WVKQKPGAGLSWIG 
KVKQKPGQGLEWIG 
KVRQPPGKGLEWLG 



I9C 



katltsd:<ssstaymelssltsedsavyycav 

KATLTADKSSNTAYMQLSSLTSZDSAVYYCAR 
RLSISKDNSKSQVrLKMNSLKTDDTARYYCAK 



95 CDR3 



VE3 
VE8 

VE-D1.3 



LLLRYFFDY 

GAWSYYAKDY 

ERDYRLDY 



113 

WGQG7TVTVSS 
WGQG7TVTVSS 
WGQGT7LTVSS 



FIG. 16 



FR1 QVQLQESGGGLVQPGGSLRLSCAASGFTFS 




SYAMS 


CDR1 


FR2 WVRQAPGKGLEWVS 




AISGSGGSTYYADSVKG 


. CDR2 


FR3 RFTISRDNSKNTLYLQMNSLRAEDTAVYYCAM 




WRGIATPVSFDLGYFDY 


CDR3 


FIG. 17 



EP 0 368 684 A1 



Pstl 



BstEl 



Pstl 



CDR3 



BstEH 



rearranged VH genes 
from immunised mouse 
spleen DNA 



pSWIHPOLYMYC 




VHD1.3gene 

PCR amplify rearranged VH genes or 
VHD1.3. Excise VH band from gel. 
Clone into vector for expression of VH 
domains in E.coli 



pSWIHPOLY 



repertoire of expressed 
VH domains from spleen 



repertoire of expressed 
VH domains with mutant 
CDR3 regions 



Assay for binding to antigen 



FIG. 18 



\ 
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pSV:2K?0LY 
Hindlll AAGCT7 

MK YLLPTAA 
GCATGCAAATTCTAT~Crj^GG^CACAGTCATAATGAAATACCTArTGCCTACGG-3AGCC 
10 20 30 40 50 60 



AGLLLLAAQPAMAQV QLQ 
GCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG 
-m Rn 90 100 110 PstI 



TCTAGA GTCGAC CTCGAG 
Xbal Sail Xfcol 

GGTCACCGTCTCC7XATAATAAGAGCTCGAATTCGCCAAGCTTGCATGC 

BstSII 430 440 450 460 470 480 

kkyllptaaag 
^ttctatttcaaggagacagtcata^tg™tacctattgcctacggcagccc<:tgga 

4S0 500 510 520 530 540 

LLLLAAQPAMAD I VLTQSPA 
7TGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGTCCTGACTCAGTCTCCAGCC 
550 560 570 580 590 600 

SLSASVGETVTITCRASGNI 
"CCCTTTCTGCGTCTGTGGGAGAAAC7GTCACCATCACATGTCGAGCAAG7GGGAATATT 
6i0 620 630 640 650 660 



HN V L-AWYQQKQG K S PQLLV Y 
ckcAA7TATT7AGCATGG7A7CAGCAGAAACAGGGAAAATCTCCTCAGC7CCTGG7CTAT 
670 680 650 700 710 720 



Y77TLADGVPSRFSGSGSGT 
7ATACAACAACC77AGCAGA7GG7GTGCCA7CAAGGTTCAG7GGCAGTGGA7CAGGAACA 
730 740 750 760 770 780 

QYSLKINSLQPE DFGSYYCQ 
CAATATTCTCTCAAGA7CA^CAGCCTGCAACCTX;AAGATTTTGGGAGTTATTACTGTCAA 
790 800 610 820 830 840 

HFWSTPR7FGGGTKL2IKR 
CA7T7T7GGAGTAC7CCTCGGACGrrCGG7GGAGGCACCAAGCTGGAAATOlAACGGTAA 

850 860 870 880 890 900 



7AAGAGC7CGAA77C 
910 



FIG. 19 



EP 0^68 6&4 All ! 



K K Y L L ? T 
AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 
10 20 30 40 50 60 

AAAGLLLLAAQPAMAQVQLQ 
GCAGCCGCTGGATTGTTATTACTCGC7GCCCAACCAGCGATGGCCCAGGTGCAGCTGCAG 
70 60 90 100 HO 120 

^SGPGLVAPSQSLSITCTVS 

gXgtcaggacctggcctggtggcgccctcacagagcctgtccatcacatgcaccgtctca 

!30 140 150 160 170 180 

g-sltgygvnwvrqp?gkgl 

GGGTTCTCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTG 
190 200 . 210 220 230 240 

eklgmiwgdgntdynsalks 
gagtggctgggaatgatttggggtgatggaaacacagactataattcagctctc^aatcc 

250 260 270 280 290 300 

RLSISKDNSKSQ V F L K M K S l 
AGACTGAGC ATC AGCAAGGACAACTCCAAGAGCCAAGTTTTC . T/\Artrt«.AGA.-iCrtGTC 

310 320 330 340 350 360 

HTDDTARYYCARERDYRLDY 
CACACTGATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGATTATAGGCrTG=,CTAC 
370 380 390 400 410 420 

W GQGTTVTVSSGGGAPAAAP 
TGGGGCCAAGGCACCACGGTCACCGTCTCCTGAGGTGGTGGTGCTCCAGCAGCTGCACCT 
430 440 450 460 470 480 

AGGGQVQLKESGPGLVAPS Q 
GCTGGAGGAGGACAGGTGCAGCTGAAGGAGTCAGGACCTGGCCTGGTGGCGCCC.GAC^G 
490 500 510 520 530 -540 

sls i'tctvsgfsltgygvnw 

AGCCTGTCCATCACATGCACCGTCTCAGGGTTCTCAT7AACCGGCTATGGTGTAAACTGG 
550 560 570 580 o90 600 

VRQPPGKGLEWLGMIWGDGN 
GTTCGCCAGCCTCCAGGAAAGGGTCTGGAGTGGCTGGGAATGATTTGGGGTGATGGAAAC 
610 620 630 640 650 660 

TDYNSAL KSRLSISKDNSKS 

ACAGACTATAATTC*GCTCTCAAATC^ 

670 680 €90 700 710 720 

OVFLKMNSLHTDDTARYYCA 
CAAGTTTTCTTAAAAATGAACAGTCTGCACACTGATGACACAGCCAG 

730 740 750 760 770 780 

RERD V RLDYWGQGTTVTVSS 
AGAGAGAGAGATTATAGGCTTGACTACTGGGGCCAAGGCACCACGGTCACCGTCTCCTCA 
790 800 810 620 830 840 



TAA7AAGAGCTC 
650 



FIG. 20 
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±0 20 30 40 at DU 

AGLLLLAAQPAMAQVQLQES 
GC-GGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCTGCAGGAGTCA 
70 80 90 100 110 l^u 

rprLVAPSQSLSITCTVSGF 
GGACCTGGCCTGGTGGCGCCCTCACAGAGCCTGTCCATCACATGCACCGTCTCAGGGTTC 
130 140 150 160 l'O iou 

SLTGVGVNWVRQPPGKGL E K 
TCATTAACCGGCTATGGTGTAAACTGGGTTCGCCAGCCTCCAGGAAAGGGTCTGGAoiGG 
ISO 200 210 220 230 ^ 

LGM JWGDGNTDYKSALKSRL 
CTGGGAJiTGATTTGGGGTGATGGAAACACAGACTATAATTCAG 

qiSKDNSKSQVFLKMNSLKT 
AGCATCAGCAAGGACAACTCCAAGAGCCAAGTTTTCTTA^AAATGAACAGTCTGCACACT 
310 320 330 340 o50 J^u 

DD* rAR vv C ARERD*RLDVWG 
GATGACACAGCCAGGTACTACTGTGCCAGAGAGAGAGAT7ATAGGCTTGACTACTGGGGC 

370 380 390 400 410 420 

QGTTV7VSSRTPEMPVLE N R 
CAAGGCACCACGGTCACCGTCTCCTCACGGACACCAGAAATGCCTG7TC7GGAA?-ACuoG 
430 440 450 460 470 480 

* tnftDITAPGGARRLTGDQT 

gctgctcLggcgatattactgcacccggcggtgctcgccgtttaacggstgatcagact 

490 500 510 520 5j0 

gccgctctgcgtgattctcttagcgataLcctgca^ 

550 560 570 580 590 t>00 

dgmgdseitaarnyaegagg 
gatgggatgggggactcggaaattactgccgcacgtaattatgccgaaggtgcgggcggc 

.610 620 630 640 650 660 

f^kgidalpltgqythyaln 
tttttta-aaggtatagatgccttaccgcttaccgggcaatacactcactatgcgctgaat 

670 680 690 700 710 1 ™ 

kktgkpd v vtdsaasataws 
a.^^ccggcaaaccggactacgtcaccgactcggctgcatcagcaaccgcctggtca 

730 740 750 760 770 780 

FIG. 21a 
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TGVKTYKGALGVDIHEKDHP 
ACCGGTGTCAAAACCTAT^CGGCGCGCTGGGCGTCGATATTCACG>-^GATCACCCA 
790 600 810 " 820 EjO 54C 

TILEMAKAAGLATGNVS T A E 
ACGATTCTGGAAATGGCAAJAGCCGCAGGTCTGGCGACCGGTAACGiTTCTACCoCAG.n-o 
850 860 670 880 890 900 

LQDATPAALVAHVTSRKCYG 
TTGCAGGATGCCACGCCCGCTGCGCTGGTGGCACATGTGACCTCGCGCAAATGCTACGGT 
91 o 920 930 940 950 960 

PSATSEKCPGNALEKGGKGS 
CCGAGCGCGACCAGTGAAAAATGTCCGGGTAACGCTCTGGAAAAAGGCGGAAAAGGATCG 
S 70 980 990 1000 1010 1020 

ITEQLLNARADVTLGGGA K T 
ATTACCGAACAGCTGCTTAACGCTCGTGCCGACGTTACGCTTGGCGGCGGCGCAAAAACC 

1050 1040 1050 1060 1070 ±060 

F-AE T AT A G E W Q . G KT LREQAC^ 
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